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INHIBITION OF GENE TRANSCRIPTION BY POLYAMIDE DNA-BINDING LIGANDS 

5 

The U.S. Government has certain rights in this invention pursuant to Grant Nos. GM 
26453, 27681, 47530 and AI 29182 awarded by the National Institute of Health. 

10 CROSS REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part of U.S. Provisional Applications S.N. 
60/038,384, filed February 14, 1997, S.N. 60/038,394, filed February 14, 1997, S.N. 60/[CIT 
2683], filed September 2, 1997, and S.N. 60/[CIT 2684], filed September 10, 1997, which are 
incorporated by reference, U.S. application 08/853,022, filed April 21, 1997, and 
15 PCT/US97/12722, filed July 21, 1997. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

20 This invention relates to polyamides that bind to predetermined sequences in the minor 

groove of double stranded DNA that are useful for diagnosis and treatment of diseases 
associated with gene transcription. This invention is related to modulation of cellular or viral 
gene expression required for maintenance and replication of pathogens in infectious disease, 
such as HTV-1 and CMV. This invention is also related to modulation of cellular gene 

25 expression in non-infectious disease conditions, such as cancers involving oncogenes, e.g., her- 
2/neu. 

Gene Therapy Approaches for HIV: 

30 Considerable effort has been expended over the past decade to devise methods to 

interfere with HIV-1 gene expression in living cells in the hope that therapeutic strategies will 
come from these studies (recently reviewed in Kohn, D.B. and N. Sarver, Gene therapy for 
HIV-1 infection, in Antiviral Chemotherapy, J. Mills, P.A. Volberding, and L. Corey, Editors. 
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1996, Plenum Press: New York. p. 421-427.). One approach includes interference with the 
translation of messenger RNA into protein by the introduction of antisense oligonucleotides 
into lymphoid cells, as discussed in Kohn, D.B. and N. Sarver, Gene therapy for HIV-1 
infection, in Antiviral Chemotherapy, J. Mills, P.A. Volberding, and L. Corey, Editors. 1996, 
Plenum Press: New York. p. 421-427; Bordier, B., et at, Proc. Natl. Acad. Sci. U.S.A., 92: 
9383-9387 (1995) and Lisziewicz, J., et al. , Proc. Natl. Acad. Sci. U.S.A., 91: 7942-7946 
(1994). 

Another approach involves ribozyme-mediated destruction of specific regions of HIV-1 
mRNA. See Sun, L.Q., et al., Proc. Natl. Acad. Sci. U.S.A., 92: 7272-7276 (1995); Yamada, 
O., et al. , J. Virol., 70: 1596-1601 (1996) and Zhou, C, et al. , Gene, 149: 33-39 (1994). 
Decoy molecules, corresponding to HIV-1 RNA domains that bind regulatory proteins required 
for the HIV-1 life cycle (TAR RNA which binds Tat or the Rev-response element) have been 
used as inhibitors of HIV-1 replication (Sullenger, B.A., et al. , Cell, 63: 601-608 (1990). In 
addition, trans-dominant mutant versions of these regulatory proteins, introduced into cells 
with retroviral expression vectors, have been shown to inhibit HIV-1 replication (Bevec, D., et 
al. , Proc. Natl. Acad. Sci. U. S. A., 89: 9870-9874, 1992.). 

Other approaches for direct inhibition of gene transcription, including designed or 
selected zinc finger peptides that recognize pre-determined DNA sequences, are described in 
Wu, H., et al., Proc. Natl. Acad. Sci. U.S.A., 92: 344-348 (1995) and Thiesen, H.-J., Gene 
Expr., 5: 229-243 (1996). DNA-cleaving ribozymes have also been tried (Raillard, S.A. and 
G.F. Joyce, Biochemistry, 35: 11693-11701(1996)). Triple helix-forming oligonucleotides 
have been used to block HIV-1 integration: Bouziane, M., et al.., J. Biol. Chem., 271: 10359- 
10364 (1996). Triple helix-forming oligonucleotides have also been used specifically cleave 
HTV-1 DNA with a metalloporphyrin group attached to the oligonucleotide, as described by 
Bigey, P., G. Pratviel, and B. Meunier, Nucleic Acids Res., 23: 3894-3900 (1995). 
Additionally, the DNA-binding calicheamicin oligosaccharides have the potential for use in 
anti-HIV-1 therapy but have not as yet been applied to this disease. See Ho, S.N., et al, Proc. 
Natl. Acad. Sci., 91: 9203-9307 (1994) and Liu, C, et al, Biochemistry, 93: 940-944 (1996). 

For any gene therapy approach to be successful, several criteria must be met by the 
therapeutic agent: First, the agent must not possess any general cell toxicity and should not 
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elicit an immune response. Second, the agent must be cell-permeable or amenable to viral 
delivery and, in the case of the DNA -binding agents, the therapeutic agent must transit to the 
nucleus and bind the target sequence with high affinity and specificity in the context of cellular 
chromatin. Third, binding of the agent to its DNA or RNA target sequence must interfere with 
gene transcription or protein translation. 

Each of the potential approaches listed above has its own unique advantages and 
limitations. For example, while nucleic acid-based approaches (antisense, decoy and triple 
helix-forming oligonucleotides and ribozymes) have the potential for sequence selectivity and 
can effectively inhibit transcription or translation in vitro, these molecules suffer from poor cell 
permeability and other delivery systems, such as retroviral vectors in the case of the ribozymes 
(Zhou, C, et al, 1994) or liposomes or other delivery strategies in the case of antisense or 
triple helix oligonucleotides, must be used for effective gene inhibition (reviewed in Kohn & 
Sarver, 1996). Similarly, zinc finger peptides must be introduced via a gene therapy approach 
with an appropriate viral expression vector since these peptides cannot directly enter cells. See 
Choo, Y, et al , Nature, 372: 642-645 (1994). 

One additional problem with gene therapy approaches is that they must be performed 
on lymphoid cells ex vivo and, once an "HTV-protected" cell population is established, these 
cells must then be introduced into the patient. 

In contrast to gene therapy approaches, HIV protease inhibitors taken in combination 
with standard anti -retroviral agents (AZT) have recently shown success in clinical trials. Wei, 
X. et al, Nature, 373: 117-122 (1995); Ho, D.D. et al, Nature, 373: 123-126 (1995). 

The key to the anti-HIV properties of these drugs is that they strike at two separate 
phases of the virus life cycle, limiting the ability of spontaneous mutations to result in 
inhibitor-resistant strains of the virus. Small molecule inhibitors of HIV- 1 RNA transcription 
which would target a third phase of the virus life cycle would be highly desirable. Cell- 
permeable sequence-specific DNA-binding ligands would circumvent the problems associated 
with other forms of gene therapy and could compliment the protease inhibitor-anti-retroviral 
agent cocktail approach mentioned above. The calicheamicin oligosaccharides satisfy some of 
the requirements for a therapeutic agent; these molecules are sufficiently hydrophobic to pass 
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through cell membranes but these molecules possess severely limited sequence specificity (4 
base pairs) and bind DNA with very low affinities (100 uM or higher required for inhibition of 
protein-DNA interactions . See Ho, S.N., et al., Proc. Natl. Acad. Sci., 91: 9203-9307 (1994) 
and Liu, C, et al, Biochemistry, 93: 940-944 (1996)). 

Thus, new classes of cell-permeable molecules that possess higher degrees of DNA 
sequence specificity and affinity are needed for the treatment of AIDS and other infectious 
diseases. We describe below the successful development of a new class of highly specific 
designed small molecule ligands with great potential for inhibition of HTV-l gene transcription. 



The HIV-1 Enhancer and Promoter: 



A recent review has summarized our current knowledge of the protein factors required 
for the control of RNA initiation and elongation by RNA polymerase II at the HTV-l promoter 
(Jones, K.A. and B.M. Peterlin. 1994. Control of RNA initiation and elongation at the HTV-l 
promoter. Annu. Rev. Biochem.. 63: 717-743). Thus only those aspects of HIV-1 transcription 
that relate to transcription inhibition are discussed herein. For HTV, the template for synthesis 
of both new viral RNA and messenger RNA (for viral protein synthesis) is the integrated 
provirus, the product of reverse transcription of the viral RNA in the infected cell. HIV-1 
utilizes the transcription machinery of the host cell but encodes its own /ra/w-activators, Tat 
and Rev, that are responsible for RNA elongation and utilization. The HTV-l promoter is 
located in the IB region of the leftward (5') long terminal repeat (see Figure 11 below, taken 
from Jones and Peterlin, 1994). The core promoter and enhancer elements span a region of 
approximately 250 base pairs and include TATA and initiator elements and the binding sites 
for the following cellular transcription factors: Spl, NF-kB, LEF-1, Ets-1 and USF. 
Sequences upstream of the NF-kB sites contribute only marginally to HTV-l promoter activity 
either in vitro or in transfected cell lymphoid cell lines. Waterman, M.L. and K.A. Jones, New 
Biologist, 2: 621-636 (1990). However, these upstream sequences, and presumably the protein 
factors which bind these upstream sequences, are important for viral replication, and hence 
promoter activity, in peripheral blood lymphocytes and in some T cell lines. Kim, J., et 
al,. J. Virol, 67: 1658-1662 (1993). 
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Two of the binding sites in the upstream region correspond to recognition sites for 
activator proteins that are lymphoid cell specific (or highly enriched in T cells) and are shared 
with the promoter of the T cell receptor (TCRa) gene: these are the Ets-1 and LEF-1 
transcription factors. The essential role of the upstream region has recently been reproduced in 
5 vitro with a chromatin reconstitution assay (Sheridan, P.L., et al, Genes Dev., 9: 2090-2104 
(1995)). 

Packaging of the HIV-l promoter into nucleosomes strongly repressed transcription, 
but this repression could be relieved by pre-incubation of the template with the HIV-l 

10 enhancer-binding proteins, LEF-1 and Ets-1. LEF-1 and Ets-1 thus apparently act in concert to 
prevent nucleosome-mediated repression in vivo. Inhibition of formation of this complex by 
small molecule inhibitors could well represent a viable target for HIV-l gene therapy. LEF-1 
is a member of the HMG family of proteins and binds DNA as a monomer. DNA binding is in 
the minor groove and results in a large distortion of the DNA helix (unwinding and bending) 

15 (Love, J.J., et al, Nature, 376: 791-795 (1995)). 

In addition to acting as an architectural transcription factor, LEF-1 possesses a strong 
/raws-activation domain which can function when artificiality transferred to other DNA- 
binding proteins (Giese, K., et ai, Genes Dev., 9: 995-1008 (1995)). This region of the HIV-l 
20 enhancer might thus prove to be an effective target for inhibition of viral transcription and 
hence virus replication. 

The HIV-l promoter also contains tandem binding sites for NF-kB, a factor that is 
strongly induced by HIV infection (Bachelerie, F., et al Nature, 350: 709-712 (1991)) and 

25 multiple binding sites for the general transcription factor Spl. The mechanisms of NF-kB 
activation have been reviewed by Jones and Peterlin, 1994. Important for this discussion, NF- 
kB has been shown to contact both Spl and the TBP subunit of the basal transcription factor 
TFIID. Perkins, N.D., et al, Mol Cell. Biol, 14: 6570-6583 (1994). Additionally, Spl has 
been shown to interact with the TAF110 subunit of TFIID (HOkDa TBP-associated factor) 

30 (Chen, J.L., et al, Cell, 79: 93-105, 1994). It is the binding of TFIID via the TBP interaction 
with the TATA element that nucleates the assembly of the complete RNA polymerase II 
transcription complex (reviewed in Maldonado, E. and D. Reinberg, Current Opinion in Cell 
Biology, 7: 352-361, 1995). Thus, NF-kB may function through recruitment of Spl and TFIID 
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to the H1V-1 promotor via these protein-protein interactions. Thus blocking the NF-kB-DNA 
and/or Spl-DNA interaction is another potential target for HIV therapy. Since these factors, 
and especially Spl and TFIID, are utilized by a wide range of cellular genes, the binding sites 
for these factors would not be good targets for HIV-specific inhibition (or any gene-specific 
inhibition). However, the sequences adjacent to these sites, that are unique to HIV-1 proviral 
DNA, are excellent candidate targets for the design of inhibitory DNA ligands (see below). 



Polyamide DNA-binding Ligands: 

Simple rules have been developed to rationally determine the sequence-specificity of 
minor-groove-binding polyamides containing jV-methylimidazole and N-methylpyrrole amino 
acids. These synthetic pyrrole-imidazole polyamides bind DNA with excellent specificity and 
very high affinities, even exceeding the affinities of many sequence-specific transcription 
factors (Trauger et al.,. Nature, 382: 559-561, 1996). An Im/Py pair distinguishes G»C from 
OG and both of these from A»T or T»A base pairs. Wade, W.S., et al. describes the design of 
peptides that bind in the minor groove of DNA at 5*-WGWCW-3' sequences (where W is 
either A or T, and a W»W pairs is an A«T or a T»A base pairs by a dimeric side-by-side motif. 
J. Am. Chem. Soc. 114, 8783-8794 (1992); Mrksich, M. et al describes antiparallel side-by- 
side motif for sequence specific-recognition in the minor groove of DNA by the designed 
peptide l-methylimidazole-2-carboxamidenetropsin. Proc. Natl. Acad. Sci. USA 89, 7586- 
7590 (1992); Trauger, J.W., et al., describes the recognition of DNA by designed ligands at 
subnanomolar concentrations. Nature 382, 559-561 (1996). A Py/Py pair specifies A»T from 
G»C but does not distinguish A«T from T«A. Pelton, J.G. & Wemmer, D.E. describes the 
structural characterization of a 2-1 distamycin A-d(CGCAAATTTGGC) complex by two- 
dimensional NMR. Proc. Natl Acad. Sci. USA 86, 5723-5727 (1989); White, S., et al. 
Biochemistry 35, 12532-12537 (1996) describes the effects of the A»T/T»A degeneracy of 
pyrrole-imidazole polyamide recognition in the minor groove of DNA, the pairing rules for 
recognition in the minor groove of DNA by pyrrole-imidazole polyamides, and also describes 
the 5'-3' N-C orientation preference for polyamide binding in the minor groove. 

It has been found that a new aromatic amino acid, 3-hydroxy-N-methylpyrrole (Hp) 
when incorporated into a polyamide and paired opposite Py, provides the means to discriminate 
A.T from T»A. White, S., et al. Nature 391 436-438 (1998). Unexpectedly, the replacement 
of a single hydrogen atom on the pyrrole with a hydroxy group in a Hp/Py pair regulates the 
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affinity and the specificity of a polyamide by an order of magnitude. Utilizing Hp together 
with Py and Im in polyamides to form four aromatic amino acid pairs (Im/Py, Py/Im, Hp/Py, 
and Py/Hp) provides a code to distinguish all four Watson-Crick base pairs in the minor groove 
ofDNA. 

5 

The preferred corresponding designed specific polyamides resulting from this invention 
are of the form 

X1X2 . . . X m -y-X( m + i) . . . X(2m-l)X2nrP-Dp 

10 wherein Xi, X2, X m , X( nl + i), X(2m - 1), and X2m are carboxamide residues 

forming carboxamide binding pairs X\/X2m 9 X2/X(2m-1> X m /X( m + i), and y is y- 
aminobuytic acid or 2,4 diaminobutyric acid and Dp is dimethylaminopropylamide, 
and where 

carboxamide binding pair Xi/X2m corresponds to base pair Ni»N'i, 
15 carboxamide binding pair X2/X(2m-1) corresponds to base pair N2»N'2, 

carboxamide binding pair X m /X( m +1) corresponds to base pair N m «N' m . 

In general, the specific polyamide DNA-binding ligands were designed by using a 
method that comprises the steps of identifying the target DNA sequence 5'-WNiN2 
20 N m W-3'; representing the identified sequence as 5'-Wa& . . • xW-3', wherein a is a first 
nucleotide to be bound by the X ] carboxamide residue, b is a second nucleotide to be bound by 
the X2 carboxamide residue, and x is the corresponding nucleotide to be bound by the X m 
carboxamide residue; defining a as A, G, C, or T to correspond to the first nucleotide to be 
bound by a carboxamide residue in the identified six base pair sequence. 

25 

Carboxamide residues were selected sequentially as follows: Im was selected as the Xi 
carboxamide residue and Py as the X2m carboxamide residue if a was G. Py was selected as 
the Xi carboxamide residue and Im as the X2m carboxamide residue if a was C. Hp was 
selected as the Xi carboxamide residue and Py as the X2m carboxamide residue if a was T. 
30 Py was selected as the X] carboxamide residue and Hp as the X2m carboxamide residue if a 
was A. 
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The remaining carboxamide residues were selected in the same fashion. Im was 
selected as the X2 carboxamide residue and Py as the X2m-1 carboxamide residue if b was G. 
Py was selected as the X2 carboxamide residue and Im as the X2m-1 carboxamide residue if b 
was C. Hp was selected as the X2 carboxamide residue and Py as the X2m-1 carboxamide 
residue if b was T. Py was selected as the X2 carboxamide residue and Hp as the X2m-1 
carboxamide residue if b was A. 

The selection of carboxamide residues was continued through m iterations. In the last 
iteration, Im was selected as the X m carboxamide residue and Py as the X m +l carboxamide 
residue if x was G. Py was selected as the X m carboxamide residue and Im as the X m +1 
carboxamide residue if x was C. Hp was selected as the X m carboxamide residue and Py as 
the X m +l carboxamide residue if x was T. Py was selected as the X m carboxamide residue 
and Hp as the X m +l carboxamide residue if x was A. 

In one preferred embodiment, the polyamide includes at least four consecutive 
carboxamide pairs for binding to at least four base pairs in a duplex DNA sequence. In another 
preferred embodiment, the polyamide includes at least five consecutive carboxamide pairs for 
binding to at least five base pairs in a duplex DNA sequence. In yet another preferred 
embodiment, the polyamide includes at least six consecutive carboxamide pairs for binding to 
at least six base pairs in a duplex DNA sequence. In one preferred embodiment, the improved 
polyamides have four carboxamide binding pairs that will distinguish A*T, T*A, OG and G*C 
base pairs in the minor groove of a duplex DNA sequence. 

DNA target sequence recognition thus depends on a code of side-by-carboxamide 
residue pairings in the minor groove of double stranded DNA. These compounds represent the 
only class of synthetic small molecules that can bind predetermined DNA sequences with 
affinities and specificities comparable to DNA-binding proteins. 
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Summary Of The Invention 

This invention provides specific polyamides that are useful for modulating the 
expression of a cellular or viral gene by binding to predetermined target sequences adjacent to 
5 the binding site of a transcription factor protein in the minor groove of double stranded DNA. 
Suitable cellular genes include both eukaryotic and prokaryotic genes. The cellular gene can 
be present in the original native cells, in cells transfected or transformed with a recombinant 
DNA construct comprising the cellular gene or in an in vitro in a cell-free system. The viral 
gene can be present in a cell or in an in vitro in a cell-free system. 

10 

The polyamides of the present invention can act as specific inhibitors of gene 
transcription in vivo or in vitro as therapeutic agents in disease conditions related to the 
transcription of at least one cellular or viral gene. Studies with three accepted model systems 
have shown that polyamides do interfere with the binding of sequence-specific minor groove 
15 transcription factor proteins as well as with components of the basal transcription machinery 
and thus block transcription of target genes. 

Hereinafter, N-methylpyrrole carboxamide may be referred to as "Py", N- 
methylimidazole carboxamide may be referred to as "Im", 3-hydroxy-N-methylpyrrole 
20 carboxamide may be referred to as "Hp", y-aminobutyric acid may referred to as "y", p-alanine 
may be referred to as "(J", glycine may be referred to as "G", dimethylaminopropylamide may 
be referred to as "Dp", and ethylenediaminetetraacetic acid may be referred to as "EDTA". 

The invention encompasses polyamides having y-aminobutyric acid or a substituted y- 
25 aminobutyric acid to form a hairpin with a member of each carboxamide pairing on each side 
of it. Preferably the substituted y-aminobutyric acid is a chiral substituted y-aminobutyric acid 
such as (R)-2,4-diaminobutyric acid. In addition, the polyamides may contain an aliphatic 
amino acid residue, preferably a p-alanine residue, in place of a Hp or Py carboxamide. The p- 
alanine residue is represented in formulas as p. The p-alanine residue becomes a member of a 
30 carboxamide binding pair. The invention further includes the substitution as a p/p binding pair 
for non-Im containing binding pair. Thus, binding pairs in addition to the Im/Py, Py/Im, Hp/Py 
and Py/Hp are Im/p, p/Im, Py/p, p/Py, Hp/p, p/Hp, and p/p. 
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The polyamides of the invention can have additional moieties attached covalently to the 
polyamide. Preferably the additional moieties are attached as substituents at the amino 
terminus of the polyamide, the carboxy terminus of the polyamide, or at a chiral (R)-2,4- 
diaminobutyric acid residue. Suitable additional moieties include a detectable labeling group 
such as a dye, biotin or a hapten. Other suitable additional moieties are DNA reactive moieties 
that provide for sequence specific cleavage of the duplex DNA. 

A central aspect of the present invention is the use of unique or rare sequences adjacent 
to the binding sites for common transcription factors as the target sequences for the design of 
polyamides. It has been found that (1) sequences adjacent to the binding sites for required 
transcription factors are unique to the genes under study and are not found in other genes in the 
current nucleic acid data bases; (2) polyamides targeted to these sequences are effective 
inhibitors of protein-DNA interactions; (3) such polyamides are inhibitors of transcription 
factor-dependent gene transcription in vitro; and (4) the polyamides are cell permeable agents 
and have been shown to inhibit transcription of target genes in human cells in culture. 

Most importantly, several designed polyamides have been shown to inhibit 
transcription of specific genes in vivo and thus these compounds must be both cell permeable 
and once inside the cell they must be able to transit the nuclear envelope and bind their target 
sites within chromatin (Gottesfeld, J.M., et al, Nature, 387: 202-205, 1997). These results 
demonstrate that designed pyrrole-imidazole polyamides are useful in the treatment of diseases, 
particularly viral diseases, including AIDS, as well as many other diseases for which specific 
candidate gene targets have been identified. 

The present invention provides specific polyamides which inhibit the transcription of 
DNA upstream or downstream of transcriptional factors such as the 5S RNA gene 
transcriptional factor TFIIIA, the minor groove-binding protein TATA-box binding protein 
(TBP), Ets-1 and the lymphoid enhancer factor LEF-1 protein. These polyamides act as gene- 
specific inhibitors of transcription since these polyamides are selective for the sequences 
flanking these protein binding sites that are, in turn, gene-specific. The polyamides are useful 
as therapeutics for the treatment of cancer as well as for the treatment of diseases caused by 
viruses and other pathogens (such as bacterial, fungal, etc.) 
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The present invention provides a composition comprising a transcription inhibiting 
amount of at least one polyamide chosen from the group consisting of ImPyPyPy-y-ImPyPy-p- 
Dp, ImPy-P-ImPy-y-ImPy-P-ImPy-fi-Dp and mixtures thereof and a pharmaceutically 
acceptable excipient suitable for the treatment of HIV-1 infection. The invention also provides 

5 a method of treating a human patient with an HIV-1 infection comprising the step of 
administering a composition comprising a transcription inhibiting amount of at least one 
polyamide chosen from the group consisting of ImPyPyPy-y-ImPyPy-P-Dp, ImPy-P-ImPy-y- 
ImPy-P-ImPy-p-Dp and mixtures thereof and a pharmaceutically acceptable excipient. 
Preferably a transcription inhibiting amount corresponds to an extracellular concentration of 

10 polyamide of about 100 nanomolar to about 10 micromolar. In one preferred embodiment, a 
transcription inhibiting amount corresponds to an extracellular concentration of about one 
micromolar to about ten micromolar ImPyPyPy-y-ImPyPy-p-Dp mixed with about one 
micromolar to about ten micromolar ImPy-p-ImPy-y-ImPy-P-ImPy-P-Dp. 

15 The present invention provides methods of treating cells in vitro as well as treating a 

human patient or a non-human organism in vivo. In one preferred embodiment, the invention 
provides a method of treating HIV-1 infected human blood cells in vitro comprising the step of 
administering a composition comprising a transcription inhibiting amount of at least one 
polyamide chosen from the group consisting of ImPyPyPy-y-ImPyPy-P-Dp, ImPy-P-ImPy-y- 

20 ImPy-P-ImPy-P-Dp and mixtures thereof. 

In other embodiments, the invention provides a diagnostic kit for detecting the 
identified target DNA sequence by employing the selective polyamides and a suitable system 
for the detection of the polyamide bound to the DNA. 

25 

Brief Description of the Drawings 

Figure 1 is a representation of a schematic model for the interaction of the nine zinc 
finger protein TFII1A with the 5S ribosomal RNA gene internal control region (ICR). 

30 

Figure 2 is a representation of the results of DNase I footprinting analysis of the 
binding of polyamide 1 and TFIIIA to the 5S RNA gene ICR. 
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Figure 3 is a representation of the results of experiments demonstrating inhibition of 5S 
RNA gene transcription in vitro. 

Figure 4 is a representation of the results of an experiment demonstrating the inhibition 
of 5S RNA gene transcription complex formation in vivo. 

Figure 5 is a schematic representation of the HIV-1 genome that encodes Gag, Pol, 
Env and six regulatory proteins as well as anticipated formally matched complexes of 1 and 4, 
and formally mismatched complexes of 2, 3, 5, and 6 with the HIV-1 target sequence 5*- 
TGCTGCA-3\ 

Figure 6 is a schematic representation of the structures of 5-y-5 polyamides 1-3, and 2- 
P-2-y-2-p-2 polyamides 4-6. 

Figure 7 is a schematic representation of part of the sequence of the restriction fragment 
used in quantitative DNase I footprinting experiments. The three 7 base pair target sites 5'- 
TGCTGCA-3' (the HIV-1 target site), 5'-TGGTGGA-3', and 5'-TGTTACA-3' are 
highlighted. 

Figure 8 is a representation of the results of a footprint experiment demonstrating the 
binding of polyamides 4-6 and 3. 

Figure 9 is a representation of the results of a MPE-Fe 11 footprint experiment using 
polyamides 4, 5, 6, and 3 at 1 nM. 

Figure 10 is a representation of the results of a MPE^Fe 11 footprint experiment showing 
binding of a seven base pair sequence by polyamides 4, 5, 6, and 3. 

Figure 1 1 is a representation of the sequence of the HIV-1 enhancer / promoter element 
showing the binding sites for the cellular transcription factors Ets-1, LEF-1, NF-kB and SP1 
along with a canonical TBP binding site, or "TATA element." 
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Figure 12 is a representation of the results of an experiment demonstrating inhibition of 
LEF-1 binding to the HIV-1 enhancer by polyamide 1. 

Figure 13 is a representation of the results of an experiment demonstrating binding to 
wild type and mutant forms of the HIV-1 enhancer by polyamide 1. 

Figure 14 is a representation of the results of an DNase footprint experiment 
demonstrating binding to the HIV-1 enhancer of the polyamide ImPy-P-ImPy-y-ImPy-p-ImPy- 
p-Dp, also called HIV-L 

Figure 15 is a representation of the inhibition of HIV-1 transcription by the polyamide 
ImPy-P-ImPy-y-ImPy-p-ImPy-P-Dp, also called HIV-1. 

Figure 16 is a schematic model of the binding of the polyamides ImPy-p-ImPy-y-ImPy- 
P-ImPy-p-Dp, also called and Imlm-P-Imlm-y-PyPy-p-PyPy-P-Dp, also called HIV-2 

to the DNA sequences adjacent to the HIV-1 TATA box. 

Figure 17 is a schematic model of the binding of polyamides designed for the 
recognition of TFIID (la - 1c) and Ets-1 /LEF-1 (2a-2b) binding sites. 

Figure 18 is a representation of the HIV-1 promoter and DNA-binding sites for 
polyamide hairpins designed to target the HIV-1 promoter. 

Figure 19 is a representation of the results of a footprint experiment comparing the 
binding of polyamides and TBP. 

Figure 20 is a representation of the results of a footprint experiment comparing the 
binding of polyamides and LEF-1 . 

Figure 21 is a representation of the results of a footprint experiment comparing the 
binding of polyamides, HIV-1 and CMV. 
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Figure 22 is a graphical representation of the results of a experimental treatment of 
human white blood cells infected with HIV. 

Figure 23 is a graphical representation of the results of a experimental treatment of 
human white blood cells infected with HIV. 

Figure 24 is a graphical representation of the results of a experiment demonstrating the 
specificity of polyamides. 

Figure 25 is a schematic representation of polyamide binding to the DNA sequence 
adjacent to the Ets-1 site. 

Figure 26 is a representation of the results of a footprint experiment comparing the 
binding of polyamides and AN331 to the HIV-1 promoter region. 

Figure 27 is a representation of the results of experiments showing the specific Ets-1 
binding inhibition produced by polyamides. 

Figure 28 is a representation of the results of a footprint experiment comparing the 
binding of polyamides and AN331 to the HIV-1 promoter region. 

Figure 29 is a schematic model of the binding of polyamides designed for the 
recognition of DNA sequence adjacent to the CMV TBP binding site and the repressor binding 
site. 

Figure 30 is a representation of the results of a footprint experiment comparing the 
binding of polyamides 1 and 5 and IE86 to the repressor region of the CMV genome. 

Figure 31 is a representation of the results of a footprint experiment comparing the 
binding of polyamides 3 and 4 and IE86 to the repressor region of the CMV genome. 

Figure 32 is a representation of the results of an experiment comparing the binding of 
polyamide 1 and IE86 on the transcription of CMV RNA. 
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Detailed Description of Preferred Embodiments 

Pyrrole-imidazole polyamides represent the only class of small molecules to date that 
5 can bind any predetermined DNA sequence. DNA recognition depends on side-by side amino 
acid pairings in the minor groove. A pairing of imidazole (Im) opposite pyrrole (Py) targets a 
G-C base pair. A Py/Py combination is degenerate and targets both T-A and A-T base pairs. 
However, Hp/Py is specific for a T-A base pair and Py/Hp is specific for an A-T base pair. 
The generality of these pairing rules has been demonstrated by targeting a variety of sequences 
10 5-13 base pairs in size and is supported directly by NMR structural studies. Eight-ring hairpin 
polyamides have affinities and specificities comparable to DNA-binding transcription factors. 

Solid phase methods for synthesizing the polyamides of the present invention have been 
described by Baird & Dervan (1 Am. Chem. Soc. y 118, 6141 (1996)). Alternatively, the 
15 polyamides can also be synthesized via solution phase methods as described by Weiss et al. (J. 
Am. Chem. toe, 79, 1266 (1957)). 

Many protein coding genes utilize both gene- and tissue-specific transcription factors as 
well as general transcription factors for transcription of mRNA by RNA polymerase II. The 

20 binding sites for these protein factors are found in numerous genes, whereas the sequences 
adjacent to these binding sites tend to be unique for each gene. Polyamide ligands can be 
designed which target both the sequences adjacent to the binding sites for these transcription 
factors as well as to the binding sequences for these factors. Polyamides that target these 
sequences will interfere with the binding of the protein factors to DNA and thereby inhibit 

25 transcription by RNA Polymerase IT. 

For example, the role of the nine zinc finger protein TFIIIA in the transcriptional 
regulation of the 5S RNA gene by RNA polymerase III has been extensively characterized. 
Zinc fingers 1-3, 5 and 7-9 bind tl?e internal control region (ICR) of the gene through base- 
30 specific interactions in the major groove. Fingers 4 and 6 are essential for high affinity DNA 
binding and have been proposed to bind in or across the minor groove (Figure 1). Well 
established methods exist for assessing in living cells the status of RNA polymerase III 
transcription complexes on the genes encoding the small 5S ribosomal RNA. 
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In another embodiment, polyamides have been designed and synthesized that recognize 
and bind the sequences immediately adjacent to the site at which the minor groove-binding 
protein TATA-box binding protein (TBP) binds to TATA DNA can be designed. DNA 
sequences adjacent to the TATA elements are gene-specific whereas TATA elements are found 
in many protein-coding genes. For example, a polyamide bound to the sequence adjacent to 
the HIV-1 TATA element has been shown to inhibit HIV-1 promoter-specific transcription by 
RNA polymerase II. A polyamide designed to selectively bind this site would be useful for 
treating diseases associated with HIV-1 infection. 

In a third embodiment, a polyamide recognizes and binds the sequence immediately 
adjacent to the binding site of lymphoid enhancer factor LEF-1 in.the HIV-1 enhancer. This 
polyamide inhibits the binding to LEF-1 protein to HIV-1 DNA. As above, a polyamide 
designed to selectively bind this site would be useful for treating diseases associated with HIV- 
1 infection. 

In another embodiment, a polyamide recognizes and binds to an identifed target 
sequence adjacent to the transcription factor protein binding site of a cellular gene. In one 
preferred embodiment, the cellular gene is a constitutively expressed gene under basal 
transcription control. An preferred cellular gene under basal transcription control is the gene 
encoding the 5S ribosomal subunit. 

In yet another preferred embodiment, the minor groove transcription factor protein of 
the cellular gene is TBP. Such preferred cellular genes include oncogenes such as LEF-1, Ets- 
1 and her-2/neu. Other such preferred cellular genes include genes encoding cytokines such as 
interleukins, including IL-2, IL-5 and IL-13, tumor necrosis factors, including TNF-alpha and 
TNF-beta, growth factors, including TGF-beta, and colony stimulating factors, including GM- 
CSF. 

Using the above described rules, a sequence-specific polyamide can be designed that 
selectively binds to an identified target site adjacent to the binding site of a minor groove 
transcription factor protein. As used herein, "adjacent" includes 1) polyamide binding sites 
wherein an end nucleotide base pair of the polyamide binding site is immediately contiguous 
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to an end nucleotide of the minor groove transcription factor protein binding site, 2) polyamide 
binding sites wherein one to five nucleotide base pairs of the polyamide binding site are shared 
with the binding sites of the minor groove transcription factor protein, and 3) polyamide 
binding sites wherein the polyamide binding site is separated from the minor groove 
5 transcription factor protein binding site by from one to four intervening nucleotide base pairs. 
The binding affinity of such a designed polyamide should be greater than the binding affinity 
of the native transcriptional element in order to inhibit transcription. The binding affinity can 
be ascertained by competitive inhibition experiments against a native transcription factor. 

10 Within this application, unless otherwise stated, definitions of the terms and illustration 

of the techniques of this application may be found in any of several well-known references 
such as: Sambrook, J., et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press (1989); Goeddel, D., ed., Gene Expression Technology, Methods in 
Enzymology, 185, Academic Press, San Diego, CA (1991); "Guide to Protein Purification" in 

15 Deutshcer, M.P., ed., Methods in Enzymology, Academic Press, San Diego, CA (1989); Innis, 
et al., PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego, CA 
(1990); Freshney, R.L, Culture of Animal Cells: A Manual of Basic Technique, 2nd Ed., Alan 
Liss, Inc. New York, NY (1987); Murray, E.J., ed., Gene Transfer and Expression Protocols, 
pp. 109-128, The Humana Press Inc., Clifton, NJ and Lewin, B., Genes VI, Oxford University 

20 Press, New York (1 997). 

For the purposes of this application, a promoter is a regulatory sequence of DNA that is 
involved in the binding of RNA polymerase to initiate transcription of a gene. A gene is a 
segment of DNA involved in producing a peptide, polypeptide or protein, including the coding 

25 region, non-coding regions preceding ("leader") and following ("trailer") the coding region, as 
well as intervening non-coding sequences ("introns") between individual coding segments 
("exons"). Coding refers to the representation of amino acids, start and stop signals in a three 
base "triplet" code. Promoters are often upstream (" 5* to") the transcription initiation site of 
the corresponding gene. Other regulatory sequences of DNA in addition to promoters are 

30 known, including sequences involved with the binding of transcription factors, including 
response elements that are the DNA sequences bound by inducible factors. Enhancers comprise 
yet another group of regulatory sequences of DNA that can increase the utilization of 
promoters, and can function in either orientation (5'-3' or 3'-5') and in any location (upstream 



WO 98/35702 



PCT/US98/02444 



- 18- 

or downstream) relative to the promoter. Preferably, the regulatory sequence has a positive 
activity, i.e., binding of an cndogcneous ligand (e.g. a transcription factor) to the regulatory 
sequence increases transcription, thereby resulting in increased expression of the corresponding 
target gene. In such a case, interference with transcription by binding a polyamide to a 
regulatory sequence would reduce or abolish expression of a gene. 

The promoter may also include or be adjacent to a regulatory sequence known in the art 
as a silencer. A silencer sequence generally has a negative regulatory effect on expression of 
the gene. In such a case, expression of a gene may be increased directly by using a polyamide 
to prevent binding of a factor to a silencer regulatory sequence or indirectly, by using a 
polyamide to block transcription of a factor to a silencer regulatory sequence. 

It is to be understood that the polyamides of this invention bind to double stranded 
DNA in a sequence specific manner. The function of a segment of DNA of a given sequence, 
such as 5'-TATAAA-3\ depends on its position relative to other functional regions in the 
DNA sequence. In this case, if the sequence 5'-TATAAA-3' on the sense strand of DNA is 
positioned about 30 base pairs upstream of the transcription start site, the sequence forms part 
of the promoter region (Lewin, Genes VI, pp. 831-835). On the other hand, if the sequence 5'- 
TATAAA-3' is downstream of the transcription start site in a coding region and in proper 
register with the reading frame, the sequence encodes the tyrosyl and lysyl amino acid residues 
(Lewin, Genes VI, pp. 213-215). 

While not being held to one hypothesis, it is believed that the binding of the polyamides 
of this invention modulate gene expression by altering the binding of DNA binding proteins, 
such as RNA polymerase, transcription factors, TBP, TFIIIA, the lymphoid enhancer factor 
protein LEF-1, and other minor groove transcription factor proteins. The effect on gene 
expression of polyamide binding to a segment of double stranded DNA is believed to be 
related to the function, e.g., promoter, of that segment of DNA. 

It is to be understood by one skilled in the art that the improved polyamides of the 
present invention may bind to any of the above-described DNA sequences or any other 
sequence having a desired effect upon expression of a gene. In addition, U.S. Patent No. 
5,578,444 in Table I lists numerous pathogens in which are found medically significant target 



WO 98/35702 



PCT/US98/02444 



- 19- 

sequences for DNA-binding drugs and in Table II lists many non-infectious diseases that may 
be controlled at the level of DNA binding proteins. 

It is generally understood by those skilled in the art that the basic structure of DNA in a 
living cell includes both major and a minor groove. For the purposes of describing the present 
invention, the minor groove is the narrow groove of DNA as illustrated in common molecular 
biology references such as Lewin, B., Genes VI, Oxford University Press, New York (1997). 

To affect gene expression in a cell, which may include causing an increase or a 
decrease in gene expression, a quantity of one or more polyamides effective to inhibit 
transcription is contacted with the cell and internalized by the cell. The cell may be contacted 
by the polyamide in vivo or in vitro. Effective transcription inhibiting extracellular 
concentrations of polyamides that can modulate gene expression range from about 10 
nanomolar to about 1 micromolar. Gottesfeld, J.M., et ai, Nature 387 202-205 (1997). To 
determine effective amounts and concentrations of polyamides in vitro, a suitable number of 
cells is plated on tissue culture plates and various quantities of one or more polyamide are 
added to separate wells. Gene expression following exposure to a polyamide can be monitored 
in the cells or in the medium by detecting the amount of the protein gene product present as 
determined by various techniques utilizing specific antibodies, including ELISA and western 
blot. Alternatively, gene expression following exposure to a polyamide can be monitored by 
detecting the amount of messenger RNA present as determined by various techniques, 
including northern blot and RT-PCR. 

Similarly, to determine effective amounts and concentrations of polyamides for in vivo 
administration, a sample of body tissue or fluid, such as plasma, blood, urine, cerebrospinal 
fluid, saliva, or biopsy of skin, muscle, liver, brain or other appropriate tissue source is 
analyzed. Gene expression following exposure to a polyamide can be monitored by detecting 
the amount of the protein gene product present as determined by various techniques utilizing 
specific antibodies, including ELISA and western blot. Alternatively, gene expression 
following exposure to a polyamide can be monitored by the detecting the amount of messenger 
RNA present as determined by various techniques, including northern blot and RT-PCR. 
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The polyamides of this invention may be formulated into diagnostic and therapeutic 
compositions for in vivo or in vitro use. Representative methods of formulation may be found 
in Remington: The Science and Practice of Pharmacy, 19th ed., Mack Publishing Co., Easton, 
PA (1995). 

For in vivo use, the polyamides may be incorporated into a physiologically acceptable 
pharmaceutical composition that is administered to a patient in need of treatment or an animal 
for medical or research purposes. The polyamide composition comprises pharmaceutical^ 
acceptable carriers, excipients, adjuvants, stabilizers, and vehicles. The composition may be in 
solid, liquid, gel, or aerosol form. The polyamide composition of the present invention maybe 
administered in various dosage forms orally, parentally, by inhalation spray, rectally, or 
topically. The term parenteral as used herein includes, subcutaneous, intravenous, 
intramuscular, intrasternal, infusion techniques or intraperitoneally. 

The selection of the precise concentration, composition, and delivery regimen is 
influenced by, inter alia, the specific pharmacological properties of the particular selected 
compound, the intended use, the nature and severity of the condition being treated or 
diagnosed, the age, weight, gender, physical condition and mental acuity of the intended 
recipient as well as the route of administration. Such considerations are within the purview of 
the skilled artisan. Thus, the dosage regimen may vary widely, but can be determined routinely 
using standard methods. 

Polyamides of the present invention are also useful for detecting the presence of double 
stranded DNA of a specific sequence for diagnostic or preparative purposes. The sample 
containing the double stranded DNA can be contacted by polyamide linked to a solid substrate, 
thereby isolating DNA comprising a desired sequence. Alternatively, polyamides linked to a 
suitable detectable marker, such as biotin, a hapten, a radioisotope or a dye molecule, can be 
contacted by a sample containing double stranded DNA. 

The design of Afunctional sequence specific DNA binding molecules requires the 
integration of two separate entities: recognition and functional activity. Polyamides that 
specifically bind with subnanomolar affinity to the minor groove of a predetermined sequence 
of double stranded DNA are linked to a functional molecule, providing the corresponding 
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bifimctional conjugates useful in molecular biology, genomic sequencing, and human 
medicine. Polyamides of this invention can be conjugated to a variety of functional 
molecules, which can be independently chosen from but is not limited to arylboronic acids, 
biotins, polyhistidines comprised from about 2 to 8 amino acids, haptens to which an antibody 

5 binds, solid phase supports, oligodeoxynucleotides, N-ethylnitrosourea, fluorescein, 
bromoacetamide, iodoacetamide, DL-a-lipoic acid, acridine, captothesin, pyrene, mitomycin, 
texas red, anthracene, anthrinilic acid, avidin, DAPI, isosulfan blue, malachite green, ethyl red, 
4-(psoraen-8-yloxy)-butyrate, tartaric acid, (-f)-a-tocopheral, psoralen, EDTA, methidium, 
acridine, Ni(II)»Gly-Gly-His, TO, Dansyl, pyrene, N-bromoacetamide, and gold particles. Such 

10 bifunctional polyamides are useful for DNA affinity capture, covalent DNA modification, 
oxidative DNA cleavage, and DNA photocleavage. Such bifunctional polyamides are useful 
for DNA detection by providing a polyamide linked to a detectable label. Detailed 
instructions for synthesis of such bifunctional polyamides can be found in copending U.S. 
provisional application 60/043,444, the teachings of which are incorporated by reference. 

15 

DNA complexed to a labeled polyamide can then be determined using the appropriate 
detection system as is well known to one skilled in the art. For example, DNA associated with 
a polyamide linked to biotin can be detected by a streptavidin / alkaline phosphatase system. 

20 The present invention also describes a diagnostic system, preferably in kit form, for 

assaying for the presence of the double stranded DNA sequence bound by the polyamide of 
this invention in a body sample, such brain tissue, cell suspensions or tissue sections, or body 
fluid samples such as CSF, blood, plasma or serum, where it is desirable to detect the presence, 
and preferably the amount, of the double stranded DNA sequence bound by the polyamide in 

25 the sample according to the diagnostic methods described herein. 

The diagnostic system includes, in an amount sufficient to perform at least one assay, a 
specific polyamide as a separately packaged reagent. Instructions for use of the packaged 
reagent(s) are also typically included. As used herein, the term "package" refers to a 

30 solid matrix or material such as glass, plastic (e.g., polyethylene, polypropylene or 
polycarbonate), paper, foil and the like capable of holding within fixed limits a polyamide of 
the present invention. Thus, for example, a package can be a glass vial used to contain 
milligram quantities of a contemplated polyamide or it can be a microtiter plate well to which 
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micrograin quantities of a contemplated polyamide have been operatively affixed, i.e., linked 
so as to be capable of being bound by the target DNA sequence. "Instructions for use" typically 
include a tangible expression describing the reagent concentration or at least one assay method 
parameter such as the relative amounts of reagent and sample to be admixed, maintenance time 
periods for reagent or sample admixtures, temperature, buffer conditions and the like. A 
diagnostic system of the present invention preferably also includes a detectable label and a 
detecting or indicating means capable of signaling the binding of the contemplated polyamide 
of the present invention to the target DNA sequence. As noted above, numerous detectable 
labels, such as biotin, and detecting or indicating means, such as enzyme-linked (direct or 
indirect) streptavidin, are well known in the art. 

Example 1: 
Transcription inhibition in vitro and in vivo. 

A high speed cytosolic extract from unfertilized Xenopus eggs was prepared as 
described in Hartl, et al., J. Cell Biol 120, 613-624 (1993) . DNA templates for transcription 
were the somatic-type 5S RNA gene contained in plasmid pXlsll (50 ng per reaction; 
Peterson et al, Cell 20, 131, 1980) and the tyrD tRNA gene contained in plasmid pTyrD (100 
ng per reaction; Stutz, et al, Genes Dev. 3, 1190, 1989) both from Xenopus laevis. 
Transcription reactions (20 jiL final volume) contained the following components: 2.5 
extract, 9 ng (12 nM) of TFIIIA isolated from immature oocytes (Smith et al, Cell 37, 645, 
1984), 0.6 mM ATP, UTP, CTP, 0.02 mM GTP and 10 ^Ci of [a- 32 P] GTP, and the final 
buffer components 12 mM HEPES (pH 7.5), 60 mM KCI, 6 mM MgCl 2 , 25 nM ZnCl 2 , and 8% 
(v/v) glycerol. Plasmid DNAs were pre-incubated with polyamides in the same buffer prior to 
adding TFIIIA and other reaction components. RNA was purified and analyzed on a 
denaturing 6% polyacrylamide gel. A Molecular Dynamics Phosphorimager equipped with 
ImageQuant software was used to quantify the effects of the polyamides on the relative 
transcription efficiencies of the 5S and TRNA genes. 

Fibroblasts from a Xenopus kidney derived cell line (kindly provided by Dr. P. Labhart, 
The Scripps Research Institute) were grown at ambient temperature in 25 cm 2 culture flasks in 
Dulbecco's modified Eagle medium containing 10% (v/v) fetal calf serum. Cells were 
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passaged for a minimum of three days prior to the addition of polyamide to the culture 
medium. Incubations were continued for various times and nuclei were prepared by hypotonic 
lysis and used as templates for transcription as described. DNA content was determined by 
measuring the absorbance of an aliquot of the isolated nuclei in 1 % (w/v) sodium dodecyl 
5 sulfate (using an extinction coefficient at 260 nM of 1 AU = 50 ng/mL DNA). The buffer 
components and labeled and unlabeled nucleoside triphosphates were as for the plasmid 
transcription reactions. Reactions were supplemented with 2 of RNA polymerase III (at 
approximately 50 [xg/mL) isolated from Xenopus oocytes. 

10 Footprinting experiments reveal 90% inhibition of TFIIIA binding in the presence of 5 

nM polyamide. After incubation with 60 nM polyamide in a cell-free extract derived from 
Xenopus oocytes, the transcriptional activity of a 5S gene is inhibited by >80%. When 100 nM 
polyamide is supplied in culture medium containing Xenopus kidney cells, transcription 
complexes on the 5S RNA genes are selectively disrupted. These results demonstrate that 

15 pyrrole-imidazole polyamides are cell permeable and can inhibit the transcription of a specific 
gene in living cells. 

Polyamides were synthesized by solid phase methods as described in Baird, E. E. & 
Dervan, P. B. J. Am. Chem. Soc. 118, 6141-6146 (1996). The identity and purity of the 
20 polyamides was verified by l U NMR, matrix-assisted laser desorption/ionization time of flight 
mass spectrometry (MALDI-TOF-MS), and analytical HPLC. MALDI-TOF-MS: 1, 1223.4 
(1223.3 calculated for M+H); 2, 1222.3 (1222.3 calculated for M+H); 3, 1223.1 (1223.3 
calculated for M+H). 

25 

The eight-ring polyamide (labelled 1 in Figure lb) having the sequence composition 
ImPyPyPy-y-ImPyPyPy-P-Dp was synthesized by solid phase methods and shown to bind the 
six base pair site S'-AGTACT^' at subnanomolar concentration (Figure 1). This sequence is 
within the binding site for zinc finger 4 of TFIIIA. Quantitative DNAse I footprint titration 
30 experiments reveal that polyamide 1 of Figure lb selectively binds the six base pair target 
sequence with a dissociation constant, Kd = 0.03 nM, a higher affinity than TFIIIA for its 50 
base pair site (K d ~ 1 nM). For controls, mismatch eight-ring polyamides ImPyPyPy-y- 
PyPyPyPy-(3-Dp (labelled 2 in Figure lb) and ImPylmPy-y-PyPyPyPy-p-Dp (labelled 3 in 
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Figure lb) were prepared which have 100-fold (K d =2.0 nM) and 1000-fold (Kd =33 nM) lower 
affinities, respectively, for the 5'-AGTACT-3'site. The models of the binding of the 
polyamides to the identified target sequences of double stranded DNA are illustrated in Figure 
lc. 

The left side of Figure la shows a schematic model for the interaction of the nine 
zinc finger protein TFIIIA with the 5S ribosomal RNA gene internal control region (ICR). 
The middle section of Figure la shows the sequence of the ICR recognized by finger 4 in 
the minor groove. The six base pair region targeted by the designed "hairpin" polyamide is 
enclosed in a rectangle. The right section of Figure la shows the expected complex of 
ImPyPyPy-Y-ImPyPyPy-P-Dp (polyamide 1) with its identified target DNA sequence, 5'- 
AGTACT-3'. Circles with dots represent lone electron pairs on N3 of purines and 02 of 
pyrimidines. Circles containing an H represent the N2 hydrogen of guanine. Putative 
hydrogen bonds are illustrated by dashed lines. 

Figure lb shows the structures of polyamides ImPyPyPy-y-ImPyPyPy-p-Dp (1), 
ImPyPyPy-Y-PyPyPyPy-P-Dp ( 2 )< and ImPylmPy-Y-PyPyPyPy-p-Dp (3). The models of 
binding of these polyamides to the target duplex DNA sequence are shown in Figure lc. 
The filled and empty circles represent imidazole and pyrrole rings, respectively, the curved 
line represents Y-aminobutyric acid (y), and the diamond represents p-alanine. Hydrogen 
bond mismatches such as G-Py and A-Im are highlighted. 

The effect of polyamide labelled 1 in Figure , lb flmPyPyPy-Y-ImPyPyPy-p-Dp) on 
TFIIIA binding to a restriction fragment isolated from a 5S RNA gene-containing plasmid was 
examined. Zfl-3, a recombinant TFIIIA analog missing fingers 4-9, binds in the major groove 
of the C-block promoter element (see Figure la). DNase I footprinting demonstrates that zfi-3 
and polyamide 1 can co-occupy the same DNA molecule. When 5 nM polyamide 1 was 
preincubated with the same DNA target, the binding of nine finger TFIIIA was inhibited by 
>90% (Figure 2). Figure 2 shows the result of DNase I footprinting analysis of the binding of 
polyamide 1 and TFIIIA to the 5S RNA gene ICR. 5'-end-labeled restriction fragments were 
derived from the 5S RNA gene by standard methods and footprinting reactions were as 
described in Clemens, K.R., et al., J. Mol. Biol. 244 23-35 (1994). Lane 1 shows a DMS G- 
specific sequencing ladder, lane 2, protein-free DNA, lanes 3-5, digestion products obtained in 
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the presence of 0.5 nM, I nM and 2 nM TFIIIA, respectively, lane 6, digestion products 
obtained in the presence of 5 nM polyamide 1. Lanes 7-9 show the digestion products of 
reactions in which the DNA was incubated with 1 for 30 min prior to the addition of TFIIIA 
(at the same concentrations as in lanes 3-5, respectively) and incubated for an additional 30 
5 min prior to DNase treatment. The differential inhibition of zfl-3 and full-length TFIIIA 
provides evidence that finger 4 interacts with or is placed in the minor groove. Polyamide 1 
does not inhibit TFIIIA binding to 5S RNA. 



Transcription of the 5S RNA gene in an in vitro system was monitored in the presence 
10 of increasing concentrations (10-60 nM) of polyamide 1. The results are shown in Figure 3. In 
these experiments, polyamide 1 was added to a 5S RNA gene containing plasmid prior to the 
addition of exogenous TFIIIA (12 nM) and a crude extract derived from unfertilized Xenopus 
eggs. As a control, a tyrosine tRNA gene was included on a separate plasmid in these 
reactions. The tRNA gene has an upstream binding site for polyamide 1, but lacks a predicted 
15 protein-DNA interaction. Both genes are actively transcribed in this system, either individually 
(lanes 1 and 15) or in mixed template reactions (lane 2). Addition of 60 nM polyamide 1 
inhibits 5S gene transcription by >80% (lane 6). Only a small degree of non-specific inhibition 
of tRNA transcription is observed at the concentrations of polyamide 1 required for efficient 5S 
RNA inhibition. The targeted 5S RNA gene is inhibited approximately 10-fold more 
20 effectively than the control tRNA gene. 



In these experiments the DNA templates were incubated with polyamide for 30 min 
prior to the addition of TFIIIA, a cytoplasmic extract prepared from unfertilized Xenopus eggs, 
and labeled and unlabeled nucleoside triphosphates. Reactions contained either the somatic- 

25 type 5S RNA gene (Figure 3a, lane 1), a tRNA*' 0 gene (lane 15), or a mixture of both genes 
(lanes 2-14). Reactions contained the following final concentrations of the indicated 
polyamide: 0 nM, lanes 1, 2, 15; 10 nM, lanes 3, 7, 11; 20 nM, lanes 4, 8, 12; 40 nM, lanes 5, 
9, 13; 60 nM, lanes 6, 10, 14. Transcription reactions were stopped after 1 hour at ambient 
room temperature and the RNA products were analyzed on a denaturing polyacrylamide gel. 

30 The positions of 5S RNA and the tRNA primary and processed transcripts are indicated at the 
right of the autoradiogram. In Figure 3b the results are represented graphically, with inhibition 
results expressed as the ratio of 5S RNA to tRNA transcription relative to the ratio obtained in 
the absence of polyamide. The error bars represent estimated standard deviations. 
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Mismatch polyamides ImPyPyPy-y-PyPyPyPy-p-Dp (2) and ImPylmPy-y-PyPyPyPy- 
(J-Dp (3) did not inliibit 5S RNA transcription at concentrations up to 60 nM. If the TFIIIA- 
DNA complex is first allowed to form, 30 nM polyamide 1 added, and the mixture incubated 
for 90 minutes prior to adding egg extract, efficient inhibition (80%) of 5S RNA transcription 
is also observed. Shorter incubation times result in less inhibition. The required incubation 
time of 90 minutes is similar to the measured half-life of the TFIIIA-DNA complex and 
supports that polyamide 1 forms a more stable complex with DNA than does TFIIIA. 

The effect of the polyamides on 5S gene transcription in vivo was determined (Figure 
4). Xenopus kidney-derived fibroblasts were grown in the presence of increasing 
concentrations of polyamide 1 in the culture medium for various times. Concentrations of 
polyamide up to 1 jaM were not toxic, as measured by cell density, if growth was limited to 
less than 72 hours. Nuclei were prepared from cells by hypotonic lysis and equivalent amounts 
of the isolated nuclei from control and treated cells were used as templates for transcription 
with exogenous RNA polymerase III and labeled and unlabeled nucleoside triphosphates. This 
experiment monitors the occupancy of class III genes with active transcription complexes. 5S 
RNA transcription can easily be assessed since the repetitive 5S genes give rise to a prominent 
band on a denaturing polyacrylamide gel. An autoradiogram was taken of the gel and the 
following observations made based on the observed autoradiogram. 

Nuclei were prepared from Xenopus kidney-derived fibroblasts that were grown in 
culture in the presence or absence of polyamides. Polyamides were included in culture 
medium (in 2.5 mL of media per 25 cm 2 flask) for 24 hours prior to harvesting cells and 
isolation of nuclei. Equal amounts of nuclei (containing 5 iig of DNA) were incubated for 2 
hours in 20 |liL reactions containing Xenopus RNA polymerase III and labeled and unlabeled 
nucleoside triphosphates. RNA was isolated from these reactions and analyzed on a denaturing 
polyacrylamide gel The results are shown in Figure 4a in which lane 1 is a 5S RNA marker; 
lane 12 is a tRNA lyrD gene marker; lane 2, cells not exposed to polyamide; lanes 3-11, cells 
exposed to the indicated polyamides 1, 2, or 3 at the following concentrations: 100 nM, lanes 3, 
6, 9; 300 nM, lanes 4, 7, 10; 1 |iM, lanes 5, 8, 1 1. Figure 4b is a graphic representation of the 
effects of polyamides 1, 2 and 3 on 5S RNA and tRNA transcription expressed as the ratio of 
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5S RNA to tRNA gene transcription relative to the ratio obtained in the absence of polyamide. 
Error bars represent estimated standard deviations. 

Concentrations of polyamide 1 as low as 100 nM have a pronounced and selective 
effect on 5S transcription. At higher polyamide concentration, a general decrease in the 
transcriptional activity of the nuclei is observed. However, phosphorimage analysis of this 
experiment reveals that at each concentration tested, the effects of the polyamide are much 
greater on 5S RNA transection than on tRNA transcription. Having established that nearly 
maximal inhibition of 5S transcription is achieved with 1 |iM polyamide 1, nuclear 
transcription was monitored after various times of cell growth in the presence of the 
polyamide. No inhibition is observed for zero time incubation with polyamide 1 at 1 |*M 
concentration, indicating that disruption of transcription complexes does not occur during or 
after the isolation or work-up of cell nuclei. Statistically equivalent levels of 5S transcription 
were observed when the cells were exposed to polyamide 1 for 24, 48 or 72 hours. 

These nuclear transcription experiments indicate that polyamide 1 is able to enter cells, 
transit to the nucleus and disrupt transcription complexes on the chromosomal 5S RNA genes. 
To rule out the possibility that the observed inhibition is due to some non-specific toxicity of 
the polyamide rather than to direct binding to the 5S RNA gene, the effects of mismatch 
polyamides 2 and 3 in the nuclear transcription assay were monitored. Only a small effect on 
5S RNA synthesis relative to tRNA synthesis is observed with 1 |iM of the mismatch 
polyamides 2 or 3 in the culture medium for 24 hours. This result indicates that the general 
inhibition of transcription observed with high concentrations of polyamide 1 may be a 
secondary effect of the inhibition of 5S RNA synthesis in vivo, rather than the result of non- 
specific polyamide interactions. Polyamide 2 affects a small enhancement of 5S RNA 
transcription in vitro and in vivo, indicating that polyamides may be able to upregulate 
transcription in certain cases. 



WO 98/35702 



PCTAJS98/02444 



■ 28- 



Example 2: 

Design and Synthesis of Optimized Polyamides 
Targeted to HIV-1 Promoter Sequences 

The experiments of Example 1 showed that an eight-ring "4-y-4 motif polyamide 
having the sequence composition ImPyPyPy-y-ImPyPyPy-p-Dp binds specifically to the six 
base pair site 5'-AGTACT-3 ' . It has also been shown recently that the 1 0-ring, "5-y-5 motif 
polyamide ImPyPyPyPy-y-ImPyPyPyPy-P-Dp specifically binds a seven base pair 5'- 
WGWWWCW-3' target sequence with subnanomolar affinity in a hairpin conformation 
(Trauger, J.W., el al, J. Am. Chan. SocM* 6160-6166, 1996). Applying the polyamide 
pairing rules to the 5-y-5 molecular template suggested that the 10-ring hairpin polyamide 
ImPyPylmPy-y-IniPyPylmPy-P-Dp would bind the HIV-1 target sequences 5'-WGCWGCW- 
3'. However, as reported below, polyamide polyamide ImPyPylmPy-y-ImPyPylmPy-p-Dp 
specifically binds the target site 5'-TGCTGCA-3', but with relatively low affinity, 
necessitating development of a second-generation polyamide. 

Design of Second Generation Polyamide 4. The simple amino acid P-alanine has 
been employed effectively as a single base pair-spanning linker in many different polyamides. 
In particular, in many cases in which a polyamide binds with relatively low affinity due to an 
apparent register mismatch between polyamide residues and their target DNA bases, 
substitution of one or more pyrrole residues with the flexible spacer p-alanine results in marked 
increases in binding affinity. These results suggested that the 8-ring polyamide ImPy-p-ImPy- 
y-ImPy-p-ImPy-p-Dp (4), which differs from 1 only by the replacement of two pyrrole residues 
with p-alanines, would specifically bind the HIV-1 target sequences 5'-WGCWGCW-3' with 
high affinity using a novel 2-p-2-y-2-p-2 motif. Equilibrium association constants were 
determined by quantitative DNase I footprint titration experiments of the formally matched 
(according to the polyamide pairing rules) polyamides 1 and 4 and the formally mismatched 
polyamides 2, 3, 5, and 6 for the HIV-1 target sequence 5'-TGCTGCA-3' (Figures 5b and 6). 
To further demonstrate the general usefulness of the 2-p-2-y-2-p-2 motif, the binding of 2 and 
5, and 3 and 6 to their respective formal match sequences 5'-TGGTGGA-3' and 5'- 
TGTTACA-3' was examined. MPE-Fe 11 footprinting and affinity cleavage studies were 
preformed for a subset of these compounds. Finally, the equilibrium association constants of 
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polyamide 4 for the HIV-1 target sequence 5'-TGCTGCA-3' were determined at either 24 °C 
or 37 °C, and using either standard polyamide assay solution conditions or approximate 
intracellular solution conditions. 

NMR spectra were recorded on a GE 300 instrument operating at 300MHz (*H). 
Spectra were recorded in DMSO-tftf with chemical shifts reported in parts per million relative 
to residual DMSO-rfj. Polyamide concentrations were determined by UV absorbance 
measurements made on a Hewlett-Packard Model 8452A diode array spectrophotometer. 
Matrix-assisted, laser desorption/ionization time of flight mass spectrometry was carried out at 
the Protein and Peptide Microanalytical Facility at the California Institute of Technology. 
Analytical HPLC was performed on a HP 1090M analytical HPLC using a Rainen CI 8, 
Microsorb MV, 5 m, 300 x 4.6 mm reversed phase column in 0.1% (wt/v) TFA with 
acetonitrile as eluent and a flow rate of 1.0 ml/min, gradient elution 1.25% acetonitrile/min. 
Preparative HPLC was carried out on a Beckman instrument using a Waters DeltaPak 25 x 100 
mm, 100 m Ci8 column in 0.1% (wt/v) TFA, gradient elution 0.25%/min. CH3CN. K coli 
XL-1 Blue competent cells were obtained from Stratagene. Restriction endonucleases were 
purchased from Boeringher-Mannheim or New England Biolabs. Plasmid isolation kits were 
obtained from Promega. Plasmid sequencing was carried at the Sequence Analysis Facility at 
the California Institute of Technology. Sequenase (version 2.0) was obtained from United 

32 

States Biochemical, and DNase I (FPLCpure) was obtained from Pharmacia, [a- P]- 

Thymidine-S'-triphosphate (>3000 Cj/mmol), [a- 32 P]-deoxyadenosine-5'-triphosphate (> 6000 
32 

Q/mmol), and [a- P]-adenosine-5'-triphosphate were purchased from Du Pont/NEN. Water 
was obtained from a Millipore Milli-Q water purification system. 

Synthesis. Polyamides were prepared by stepwise solid-phase synthesis using BOC- 
protected monomers as described by Baird, E.E.; Dervan, P.B. J. Am. Chem. Soc. 1996, 118, 
6141-6146. 

Plasmids. Plasmid pJT2B2 was prepared by hybridizing the complementary 
oligonucleotides, 

5-CCGGCTTAAG TTCGTGGGCC ATGCTGCATT CGTGGGCCAT GGTGGATTCG 
TGGGCCATGT TACATTCG-3' and 
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5-TCGACGAATG TAACATGGCC CACGAATCCA CCATGGCCCA CGAATGCAGC - 
ATGGCCCACG AACTTAAG-3', and ligating the resulting duplexes to the large pUC19 Ava 
USal I restriction fragment. 

The plasmid was transformed into E.colL and plasmid DNA isolated using standard 

32 

methods, and the sequence of the insert confirmed by sequencing. The 3'- P end-labeled Eco 

RI/Pvw II restriction fragment was prepared by simultaneous digestion with Eco RI and Pvu II 

32 

and 3'-fill-in using Sequenase, [a- 32 P]-deoxyadenosine-5'-triphosphate, and [a- P]- 
thymidine-5'-triphosphate. The 282 bp fragment was isolated by nondenaturing gel 
electrophoresis. The 5'- 32 P-end-labeled EcoRVPvuW fragment was prepared using standard 
methods. See Sambrook, J.; Fritsch, E.F.; Maniatis, T. Molecular Cloning; Cold Spring 
Harbor Laboratory: Cold Spring Harbor, NY, 1989. 

A-specific chemical sequencing was carried out as described by Iverson, B.L.; Dervan, 
P.B. Nucleic Acids Res. 1987, 15, 7823-7830. Standard methods as described by Sambrook, et 
al. (1989) were used for all DNA manipulations. 

Quantitative DNase I Footprints g. Equilibrium association constants for polyamide- 
DNA complexes were determined by quantitative DNase 1 footprint titration experiments. 
Reactions were carried out in a total volume of 400 uL in the absence of carrier DNA. A 
polyamide stock solution or H2O (for reference lanes) was added to a solution containing 3'- 
end-labeled restriction fragment (15,000 cpm) affording final solution conditions of either 10 
mM Tris-HCl, 10 mM KC1, 10 mM MgCl 2 , 5 mM CaCl 2 , pH 7.0. at 24 °C, or 10 mM 
HEPES»HC1, 140 mM KC1, 10 mM NaCl, 1 mM MgCl2, 1 mM spermine, pH 7.2 as indicated 
in Tables 1 and 2. The solutions were allowed to equilibrate at 24 °C or 37 °C as indicated in 
Tables 1 and 2 for 12-16 hr. Cleavage reactions were initiated by the addition of 10 U-L of a 
DNase I stock solution (at the appropriate concentration to give -55% intact DNA) containing 
1 mM dithiothreitol, allowed to proceed for 7 min. at 24 °C, or 3.5 min. at 37 °C, and stopped 
by the addition of 50 uL of a solution containing 2.25 M NaCl, 150 mM EDTA, 0.6 mg/mL 
glycogen, and 30 uM base-pair calf thymus DNA, and precipitated with 1 mL ethanol. 
Reactions were then resuspended in lx TBE/80% formamide, heated at 85 °C for 10 min, 
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placed on ice, and the reaction products separated by electrophoresis on an 8% polyacrylamide 
gel (5% cross-link, 7 M urea) in lx TBE at 2000 V. Gels were dried, exposed to a storage 
phosphor screen (Molecular Dynamics), and imaged using a Molecular Dynamics 400S 
Phosphorlmager. Data was obtained from the imaged gels by quantitation using ImageQuant 
software (Molecular Dynamics). Background-corrected volume integration of rectangles 
encompassing the footprint sites and a reference site at which DNase I reactivity was invariant 
across the titration generated values for the site intensities (I s it e ) and the reference intensity (L 

ref)- ^he a PP arent fractional occupancy (0app) of the sites was then calculated using the 

equation: 

<W - i - ■""'j". (i) 

I site / I ref 

where I° s j te ancl I°ref are site and re f ercnce intensities, respectively, from a control lane to 
which no polyamide was added. The ([L] tot , 6 a pp) data points were fit to a general Hill 
equation (eq. 2) by minimizing the difference between 9 a pp and 9fit: 



9fit = Gmin + (Omax - Gmin ) — (2) 

1 + Ka n [L] n tot 

where [L] tot is the total polyamide concentration, K a is the equilibrium association constant, 
and 0 m in and 9 m ax are the experimentally determined site saturation values when the site is 
unoccupied or saturated, respectively. The data were fit using a nonlinear least-squares fitting 
procedure with K a , 9 max , and 9 m i n as the adjustable parameters, and with a fixed value for n. 
For hairpin poly amides 1-6, binding isotherms were adequately fit by Langmuir isotherms (eq. 
2, n=l), consistent with formation of a 1:1 polyamide-DNA complexes. For unlinked 
polyamides 7-9 binding to their match sequences, binding isotherms were adequately fit by a 
cooperative isotherm (eq. 2, n=2) consistent with cooperative dimeric binding. Reported 
association constants are the average value obtained from at least three independent 
footprinting experiments. 

MPE-Fe 11 Footprinting and Affinity Cleavage Experiments* Exact binding sites 

were determined by MPE*Fe TI footprinting. Binding orientation was probed using affinity 
cleavage experiments. All reactions were carried out in a total volume of 400 |xL. A stock 

solution of polyamide (for MPErFe 11 footprinting), polyamide-EDTA (for affinity cleavage), 
or FtoO (for reference lanes) was added to a solution containing 3'- or 5' -end-labeled 
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restriction fragment (15,000 cpm) affording final solution conditions of 20 mM HEPES, 200 
mM NaCl, 50 ^ig/mL glycogen, and pH 7.3, and the solution allowed to equilibrate for 12 hr at 

24 °C. Next, for MPE'Fe 11 footprinting reactions, MPE'Fe 11 was added to a final 
concentration of 0.5 and the solution allowed to equilibrate for 10 min. (a 5 nM 
MPE-FE 11 stock solution was prepared by mixing equal volumes of 10 jaM MPE and freshly 
prepared 10 yM Fe(NH4)2(S04)2), and for affinity cleavage reactions, freshly prepared 
Fe(NH4)2(S04)? was added to a final concentration of 1 |iM and the solution allowed to 

equilibrate for 30 min. For both MPE'Fe 11 footprinting and affinity cleavage, cleavage was 
initiated by the addition of dithiothreitol to a final concentration of 5 mM and allowed to 
proceed for 30 min at 24 °C, then stopped by adding 1 mL ethanol. Next, 10 >iL of a solution 
containing calf thymus DNA (140 \xM base-pair) (Pharmacia) and glycogen (Boehringer- 
Mannheim) (2.8 mg/mL) was added, and the DNA precipitated. The reactions were 
resuspended in IX TBE/80% formamide, heated at 85 °C for 10 min, placed on ice, and the 
reaction products separated by electrophoresis on an 8% polyacrylamide gel (5% cross-link, 7 
M urea) in IX TBE at 2000 V. Gels were dried and imaged as described above. Relative 
cleavage intensities were determined by volume integration of individual cleavage bands using 
ImageQuant software (Molecular Dynamics). 



Results and Discussion 



Synthesis. 

Polyamides were synthesized by solid phase methods as described in Baird, E.E.; 
Dervan, P.B. J. Am, Chem. Soc, 118, 6141-6146 (1996). 

Binding Affinities. 

Quantitative DNase I footprint titration experiments (10 mM Tris'HCl, 10 mM KC1, 10 
mM MgCl2, 5 mM CaCl2, pH 7.0, 24 °C) were carried out on the 282 bp, 3'- 32 P end-labeled 
pJT2B2 EcoRUPvuU restriction fragment which contains the three target sites S'-TGCTGCA- 
3' (the HIV-1 target sequence), 5'-TGGTGGA-3\ and 5'-TGTTACA-3' (Figure 7). These 
experiments reveal that the 5-y-5 motif polyamide 1 binds the HIV-1 target sequence 5'- 

7 -1 

TGCTGCA-3' with a relatively low equilibrium association constant, K a = 8 x 10 M , while 
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no binding of the 5-y-5 polyamide 2 to its target site 5'-TGGTGGA-3' is observed (K a < 5 x 
7 -1 

10 M ). In contrast, 5-y-5 polyamide 3 binds its target site 5'-TGTTACA-3' with very high 
affinity (K a = 5x 10 10 M" 1 ). (Figure 8, Table 1). 

5 Quantitative DNase I footprinting experiments reveal that the three 2-p-2-y-2~P-2 motif 

polyamides 4, 5 and 6 selectively target their respective match sequences 5'-TGCTGCA-3' 
(the HIV-1 target sequence), 5'-TGGTGGA-3\ and 5'-TGTTACA-3' with equilibrium 

association constants greater than 1 xlO 9 M~ 1 (Figure 8, Table 1), demonstrating that the 2-p- 
2-y-2-P-2 motif allows specific targeting of a range of 7 base pair sequences with 
10 subnanomolar affinity, and is generally useful for all 5'-WNNWNNW-3' sequences. 
Polyamide 4 specifically binds the HIV-1 target sequence 5'-TGCTGCA-3' with 

subnanomolar affinity (Ka = 2 x 10 10 M" 1 ), and thus represents a solution to the present 
polyamide design problem. 



Table 1. Equilibrium association constants (M'Y 



Polyamidc 5' aTGCTGCAtO' 5 -aTGCTCGAt-}' 5-aTGTrACAi-3' 



5-y-5motif: 

MmPyPylmPyvlmPyPylmPy-p-Dp 8.3 x I0 7 (I.S) < I x IO 7 

2 ImlmPylmIm-Y-PyPyPyPyPy-3-Dp < 5 x I0 7 < 5 x ID 1 

3 ImPyPyPyPy-y-lmPyPyPyPy-P-Dp |.2 x 10 9 (0.4) < 5 x 10 s 



2-0-2 ^2-0.2 motif: 

4 lmPy-3-ImPy-Y-lmPy.f5-ImPy-()-Dp h \,$ x io 10 (0-3) < 5 x 10* < 5 x 10* 

5 Imlm-P-lmlm-Y-PyPy-p-PyPy-P-Dp 1.7 x 10 s (0.3) 7.6 x 10* (1.0) < I x 10 s 

6 imPy-P-PyPy-y-ImPy-p-PyPy-P-Dp < I x IO 3 < I x 10 s 2.6 x 10 9 (l.O) 



Unlinked dimer motif: 

7 ImPyPyPyPy-fJ-Dp |.| x K) 7 (0.2) < I x IO 7 3.9 .x I0 S ,(0.6) 

8 lmPy-&-lmPy-p-Dp 3.0 x 10* (0.2) < 2 x IO 5 1.5x10* (0.3) 

9 ImPy-p-PyPy-P-Dp < 5 x IO 5 < 5 x 10 S 2.3 x 10 6 (0. 1 ) 

'Values reported- a/c the mean values from at least ihrec DNasc I footprint titraiion experiments. The standard 
deviation for each value is indicated in parentheses. The assays were earned out at 24 "C. pH 7.0. in the presence of 10 
m.M Tris»HCl. 10 mM KCI. 10 mM MgCK. and 5 mM CaCk Nucleotides flanking polyamide binding sites arc in 
lowercase type. Association constants corresponding to formally matched complexes are in bold type. ' 

This polyamide binds the site S'-tgUacaiTCGTCGAcO" adjacent to the 5*-TGTTACA-3' site with an equilibrium 
association constant of 2.1 x 10* M" 1 (0.1). 



15 

Failure of Polyamide 1 . 

The failure of polyamide 1 to bind the HIV-1 target sequence with high affinity is 
consistent with previous results. For example, the 4-y-4 motif polyamides ImPyPyPy-y- 
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ImPyPyPy-p-Dp, ImPyPyPy-y-PyPyPyPy-P-Dp and ImlmPyPy-y-ImlmPyPy-P-Dp bind 

respective 6 base pair target sequences with subnanomolar affinity (Ka > 10 10 M *)» while 
IniPylmPy-y-ImPylmPy-p-Dp and Imlmlmlm-y-PyPyPyPy-p-Dp bind respective target 

sequences with substantially lower affinity (Ka < 10 8 M" 1 )- Based on these and additional 
results, the common feature of hairpin polyamides that bind with high affinity appears to be the 
placement of imidazole residues only at the first and second positions of polyamide subunits, 
counting from the N-terminal end, although in some cases substitution at the third position is 
tolerated. Imidazoles placed beyond the second position of a polyamide subunit are not 
positioned optimally for specific ligand-DNA contacts. 

The 5-Y-5 polyamides 1, 2, and 3 are analogs of 4, 5, and 6, respectively, in which the 
P-alanine residues have been changed to pyrrole residues (Figure 6). Polyamides 1 and 2, 
which have imidazole residues beyond the second position of a polyamide subunit, have 
equilibrium association constants for their match sites more than 100-fold lower than their 0- 
alanine-substituted analogs 4 and 5, respectively (Table 1). Thus, the 2-p-2-y-2-p-2 motif is 
essential for high-affinity recognition in these cases. In contrast, polyamide 3 having no 
imidazoles beyond the second position of a polyamide subunit binds its match site with an 
equilibrium association constant ~20-fold higher than its p-alanine-substituted analog 6 (Table 

1). 

The following model is consistent with our data: 1) In the 2-P-2-y-2-p-2 polyamides 4 
and 5 the flexible p-alanine linkers correctly position the imidazole residues following them for 
specific hydrogen bond contacts with their target guanine bases, while in 5-y-5 polyamides 4 
and 5 the analogous hydrogen bonds cannot form. 2) In contrast, for both 2-p-2-y-2-P-2 
polyamide 6 and polyamide 5-y-5 3, the pyrroles beyond the second positions within 
polyamide subunits are correctly positioned, and 3 binds with lower affinity than 6 due to its 
greater conformational entropy. 

These results indicate that the optimal binding positions, or "binding register," is 
different for imidazole and pyrrole residues, consistent with previous results which suggest 
that, for subunits composed entirely of rings (i.e. without p-alanine linkers), imidazole residues 
go out of register after 2-3 residues with substantial drops in affinity, while pyrrole residues go 
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out of register after 5 rings with a gradual leveling of affinity before a substantial drop at 8 
rings. Finally, these results indicate that the 2-P-2-y-2-(i-2 and 5-y-5 motifs are complementary 
for recognition of 7 base pair sequences, with the optimal motif dependent on the sequence 
composition of the desired target site. 

5 

Relative Effects of Covalent Linkage with y-Aminobutyric Acid. 

Equilibrium association constants for the "unlinked dimer motif polyamides 7, 8, and 
9 were measured to allow comparison with their corresponding y-aminobutyric acid-linked 
polyamides 3, 4, and 6. Polyamides 4 and 6, which are composed of 2-y-2 subunits, bind with 
10 ~6000-fold and -1000-fold higher affinity, respectively, relative to their respective unlinked 
analogs 8 and 9. Polyamide 3, which is composed five-ring subunits, binds with -100-fold 
higher affinity than its unlinked analog 7. The significantly larger increase in affinity upon 
linkage with y-aminobutyric acid for 8 and 9 relative to 7 is consistent with the greater 
structural rigidity and consequently greater preorganization of 7. 

15 

Exact Binding Sites and Binding Orientations. 

MPE»Fe^ footprinting experiments carried out with the four polyamides 4, 5, 6, and 3, 
which have subnanomolar binding affinities, confirm that these compounds specifically bind 
their respective seven base pair match sites (Figures 9a and 10a). Affinity cleavage 

20 experiments were performed with the EDTA-polyamides 4-E, 5-E, 6-E, and 3-E to identify the 
location of the C-termini of these polyamides when bound to their target sites (Figures 6, 9b, 
10b). The symmetric polyamides 4-E, 6-E, and 3-E produce cleavage patterns at both sides of 
their binding sites as expected, consistent with two distinct binding orientations (Figure 9c). It 
appears that, in general, side-by-side polyamide-DNA complexes preferentially bind in the 5* 

25 to 3' (N- to C-terminus) orientation. Recent results of studies with hairpin polyamides are 
consistent with formation of 3' to 5 '-oriented polyamide-DNA complexes, and suggest that 
such "reversed orientation" complexes typically have affinities -10-fold lower than analogous 
5' to 3' -oriented complexes. Consistent with this trend, polyamide 4 binds the "reversed 
orientation" match site 5'-TCGTCGA-3' with an affinity -10-fold lower than it binds the site 

30 5'-TGCTGCA-3' (see footnote to Table 1). Inconsistent with this trend, however, the 
assymmctric polyamide 5-E produces a cleavage pattern (Figure 10b) consistent with the 
polyamide binding preferentially in the reversed orientation (3' to 5', N- to C-terminus) as 
illustrated (for the parent compound 5) in Figure 10c. 
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Effect of Physiological Salt and Temperature. 

Quantitative DNase I footprinting experiments indicate that increasing the equilibration 
temperature from 24 °C to 37 °C, and changing the solution conditions from standard 
polyamide assay conditions (10 mM Tris-HCl, 10 mM KC1, 10 mM MgCl 2 , 5 mM CaCl 2 , pH 
7.0 at 24 °C) to conditions modeling those encountered within a typical mammalian cell (140 
mM KC1, 10 mM NaCl, 1 mM MgCl 2 , 1 mM spermine, pH 7.2) has little effect on polyamide 
binding affinity (Table 2). 



Table 2. Equilibrium association constants (M* n ) of 
polyamide 4 for its match site 5'-TGCTGCA-3V 



Buffer* 


Temp. 


5'-aTGCTGCAtt-3' 


A 


24 °C 


1.5 x 10 10 (0.3) 




37 °C 


8.4 x 10 9 (23) 


B 


24 °C 


1.9 x 10 l0 (0.6) 




37 °C 


1.1 x 10 10 (0.2) 



'Values reported arc the mean values from at least three 
DNase 1 footprint liiraition experiments. The standard 
deviation for each value is indicated in parentheses. 

"Buffer A: 10 mM Tris-HCl. 10 mM KG. 10 mM 
M"CU. and 5 mM CaCU. pH 7.0 at 24 °C; Buffer B: 10 
mM HEPES-HC1. 140 mM ICCl. 10 mM NaCl. 1 mM 
MgCl : , 1 mM spermine. pH 7.2. 



Example 3: 

Binding of Polyamides to HIV-1 Promoter Sequences 

Sequence-specific DNA-binding small molecules that can permeate human cells could 
potentially regulate transcription of specific genes. Pyrrole-imidazole polyamides were 
designed to bind DNA sequences proximal lo binding sites for the cellular transcription factors 
TBP and LEF-1 utilized by HIV-1 for RNA synthesis. These synthetic ligands inhibit HIV-1 
transcription in cell-free assays and virus replication in isolated human peripheral blood 
lymphocytes. The ability of small molecules to target predetermined DNA sequences located 
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within RNA polymerase II promoters suggests a general approach for regulation of gene 
expression, as well as a mechanism for the inhibition of viral replication. 



In Example 1, above, it was demonstrated that an eight-ring hairpin polyamide targeted 
5 to a specific region of the transcription factor TFIIIA binding site inhibits 5S RNA gene 
transcription by RNA polymerase 111 in Xenopus kidney cells. See also Gottesfeld, J.M., et al., 
Nature 387, 202 (1997). Using a similar approach, it has been found that polyamides can 
inhibit HIV-1 transcription in cell-free assays and viral replication in human lymphocytes. 



10 The H1V-1 enhancer / promoter element contains binding sites for the cellular 

transcription factors Ets-1, LEF-1, NF-icB and SP1 along with a canonical TBP binding site 
(TATA element) and an initiator sequence (Figure 11, taken from K. A. Jones & B. M. Peterlin 
Annu. Rev. Biochem. 63, 717 (1994)). Jones & Peterlin (1994) describe the composition of the 
HIV-1 enhancer region sequence. Binding sites for such transcription factors are not optimal 

15 polyamide target sequences because they are found in the promoters of many protein-coding 
genes. However, the sequences immediately flanking these transcription factor binding sites 
vary from gene to gene. In some cases, transcription factor flanking sequences are conserved 
for a particular gene providing an address for gene-specific, targeting of common transcription 
factors. For example, the sequences immediately upstream and downstream from the HIV-1 

20 TATA element are of the form 5'-WGCWGCW-3' (where W = A or T). At least one copy of 
the 5'-WGCWGCW-3' sequence is located adjacent to the HIV-1 TATA element in all 
sequences of the HIV-1 LTR found in the EiYLBL and Genbank data bases K. Freeh, R. Brack- 
Werner, T. Werner describe the common modular structure of viral LTR's. Virology 224, 256 
(1996) However, the 5 ' -TG CTG C AT AT A AG C A G CT- 3 ' TATA element is found only in 

25 strains and isolates of HIV-1 and in no other sequence in either data base. The activity of the 
HIV-1 promoter, and especially Tat-induced activity, is critically dependent upon this 
sequence. For example, point mutations in the TATA flanking region reduces activated 
transcription by 10-fold without effecting basal transcription. Tat-transactivation is essential 
for HIV-1 RNA synthesis and virus replication. B. Berkhout, K.-T. Jeang describe TAT 

30 induced expression of the HIV-1 long terminal repeat J. Virol 66, 139 (1992); H. S. Olsen, C. 
A. Rosen, describe contribution of the TATA motif to the TAT-mediated transcriptional 
activation of HIV-1 gene expression. 1 Virol, 66, 5594 (1992). M. R. Sadaie, T. Benter, F. 
Wong-Staal. describe site directed mutagenesis of the 2-trans regulatory genes of HIV-1. 
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Science 239, 910 (1988). Although the possibility of mutational change cannot be precluded, it 
would expected that targeting the fflV-1 TATA flanking sequence with a polyamide might 
prove effective for inhibition of HIV-1 transcription and virus replication. A 5'-WGCWGCW- 
3' sequence which is not conserved in all strains of HIV-1 is located adjacent to the Ets-1 
binding site. An additional search identified a 5'-WGWWCW-3' sequence adjacent to the 
binding site for LEF-1 which was conserved for most reported strains of HIV-1. In order to 
regulate pol II transcription of only HIV-1, polyamides were designed to bind immediately 
adjacent to rather than directly to the binding sites for TBP and LEF-1 (Figure 11). 

DNase footprint titrations showed that polyamide 1 bound to the site adjacent to the 
LEF-1 binding site with an apparent dissociation constant of 0.1 nM (Figure 12, site Pl-1). The 
labeled DNA was incubated with the following concentrations of polyamide 1 for 45 min prior 
to the addition of LEF-1 DBD to a final concentration of 8 nM (in the reactions of lanes 7 to 
14)- no polyamide (lanes I and 7); 10 P M (lanes 2 and 8); 30 P M (lane 9); 0. 1 nM (lanes 3 and 
10)- 0 3 nM (lane 1 1); 1 nM (lanes 4 and 12); 3 nM (lanes 5 and 13); 10 nM (lanes 6 and 14). 
After a second incubation period of 45 min, the samples were subjected to DNase I digest™ 
and analyzed by gel electrophoresis. The location of polyamide sites Pl-1 to PM, LEF-1 sites 
L1-L3 the LEF-1 DNase hypersensitive (HSS) site and the start-site for transcription (+1) are 
shown along the side the autoradiogram. Three additional potential binding sites for polyamide 
1 are present in the HIV-1 promoter ( 5'-TGTACT-3') and coding sequence (5'-AGATCT-3') 
and polyamide 1 also binds these sequences (Pl-2 to PM, Figure 12). In agreement with 
previous published studies, recombinant LEF-1 protein (recombinant DNA binding domain) 
bound to its target site with an apparent dissociation constant of 1.5 nM (as determined by 
DNase footprint titrations). Two additional LEF-1 sites are found in the HIV-1 promoter and 
coding sequences (sites L2 and L3, Figure 12). Importantly, polyamide 1 proved to be an 
effective inhibitor of LEF-1 binding to each of these sites with a Ki of 0.1 nM for the enhancer 
site Importantly, inhibition was observed either when the polyamide was incubated with the 
DNA before adding LEF-1 or after preincubation of the DNA with LEF-1. In similar DNase 
footprinting experiments, the mismatch polyamides did not inhibit LEF-1 binding even at a 
, 100-fold higher concentration that thai needed for inhibition by polyamide 1 . 
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Based on the results described above, it was expected that polyamide inhibition of LEF- 
1 binding would inhibit LEF-1 -dependent transcription in an appropriate in vitro assay system. 
The effects of polyamide 1 and mismatch polyamide 2 on HIV-1 transcription were tested in an 
in vitro system consisting of a cell-free extract prepared from cultured human lymphoid H9 
cells. This whole-cell extract contains high levels of LEF-1 protein but supports only low 
levels of transcription, suggesting a limitation for other transcription components in this 
extract. The H9 cell extract was supplemented with the HeLa cell-derived nuclear extract in 
order to obtain high levels of transcription. The H9 cell extract stimulates HIV-1 transcription 
-2.5 to 3-fold over the level of transcription observed with the HeLa extract alone and 
immunodepletion of LEF-1 protein from the H9 extract abolishes this activated transcription. 
Polyamide 1 is an effective inhibitor of HIV-1 transcription in this system: 50% inhibition of 
transcription is obtained in polyamide titration experiments between 10 and 30 nM polyamide 
1 in the reaction. Polyamide 1 fails to inhibit HIV-1 transcription in the LEF-1 -depleted 
extract or with the HeLa extract alone. Similarly, LEF-1 depletion had no effect on CMV 
MIEP transcription. The effects of the mismatch polyamide 2 on HIV-1 and CMV 
transcription were tested as additional controls. No potential binding sites for either polyamide 

I or 2 are present in the CMV MIEP sequence. As expected, polyamide 1 fails to inhibit CMV 
transcription and mismatch polyamide 2 fails to inhibit either HIV-1 or CMV transcription. 

Notably, HIV-1 transcription was not inhibited by polyamide 1 at concentrations as 
high as 1 \iM even though binding sites for the polyamide occurred near the start-site for 
transcription (-2 to -7) and within the transcribed sequence (+20 to +25). A polyamide was 
designed and synthesized in inhibition studies with the human cytomegalovirus (CMV) 
immediate early promoter. A potential binding site for polyamide 3 (Figure 1 above) is located 
near the start-site for CMV transcription. Similar to our results with polyamide 1 and HIV-1 
transcription, polyamide 3 had no effect on CMV transcription in our reconstituted assay 
system. These results with both the HIV-1 and CMV promoters suggest that RNA polymerase 

II can initiate transcription and read through DNA bound with a polyamide in the minor 
groove. Thus, a polyamide must interfere with an essential DNA-binding protein to be an 
inhibitor of transcription. 
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Another well-studied minor groove DNA-binding protein is the TATA-box binding 
protein [TBP]. X-ray crystallographic studies of the TBP-DNA complex have clearly shown 
that TBP binds DNA in the minor groove and that binding results in a severe distortion of the 
DNA duplex (unwinding and bending). 

Since TATA elements are found in many protein-coding genes (with the exception of 
the so-called "housekeeping" genes), the TATA element itself is not a good candidate for a 
polyamide target sequence. However, the sequences immediately flanking TATA boxes are 
unique to each individual gene with little or no homology between genes. Thus, by analogy 
with our LEF-1 results, it would be expected that targeting the sequence immediately adjacent 
to a TA TA element with a polyamide might inhibit TBP binding and thus inhibit transcription 
of that gene by interfering with assembly of the RNA polymerase II transcription complex. 
Note that it is the binding of the TBP subunit of TFIID that nucleates the assembly of the RNA 
polymerase II transcription machinery for TATA-containing genes. 

As a model system the polyamide 1 binding site was introduced immediately adjacent 
to the HIV-1 TATA element by oligonucleotide-directed mutagenesis. Thus the TATA region 
sequence was changed from 5'-CATATAAGCAGCT-3' (WILD-TYPE) to 5'- 
CATATAAGTACTT-3' (MUTANT, polyamide 1 site underlined). 

The effect of the polyamide on the formation and stability of the TBP-DNA complex 
was determined using a gel mobility shift assay. As observed in previous gel mobility shift 
assays with recombinant TBP, both monomer and dimer TBP-DNA complexes are observed 
with both the wild-type and mutant TATA-box DNA fragments. Polyamide 1 inhibited TBP 
binding to the "mutant" HIV-1 promoter DNA probe but did not significantly inhibit TBP 
binding to the wild-type HIV-1 promoter probe (Figure 13 A). Similarly, polyamide 1 inhibited 
basal transcription from the "mutant" HIV-1 promoter but not from the wild-type promoter in a 

reconstituted transcription system containing a HeLa cell nuclear extract (Figure 13B, below). 

Approximately 70% inhibition of basal RNA polymerase II-mediated transcription was 

observed in this assay with polyamide 1 at a concentration of 100 nM. Additionally, neither of 

the mismatch polyamides inhibited TBP binding or transcription. 
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Based on these results, another polyamide was designed and synthesized to specifically 
target the HIV-1 TATA element as described in Example 2 (Figure 14, below): As shown 
below, this polyamide (ImPy-P-ImPy-y-ImPy-p-ImPy-p-Dp, called HIV-1 in Figs. 14-116 and 
polyamide 1 in Figs. 18-24) was designed to recognize the sequences immediately 5' and 3 f to 

5 the HIV-1 TATA element. The recognition sites for polyamide HIV-1 are of the form 5- 
WGCWGCW-3" (where W = A or T). Since a pyrrole-pyrrole pair can recognize either an A-T 
base pair or a T -A base pair, polyamide HIV-1 will recognize both the 5 1 and 3* flanking 
sequences of the HIV-1 TATA element. Binding of the polyamide was confirmed by DNase I 
footprinting and an apparent dissociation constant of 0.05 nM (K a = 2 X 10 10 M" 1 ) was 

10 determined for both sites (Figure 14). This polyamide inhibits HIV-1 transcription in an in 
vitro transcription assay with the lymphoid cell nuclear extract and a wild-type HIV-1 
promoter but not transcription of a control template lacking the binding site (the CMV 
immediate early promoter). A quantitative representation of the data is shown in Figure 15. 
Fifty percent inhibition of HIV-1 transcription is observed at approximately 50 nM polyamide 

15 in the reaction. This concentration corresponds to only a 16-fold excess of polyamide over 
specific binding sites in the HIV-1 plasmid DNA (3 nM). 

A corresponding control polyamide was also synthesized (Imlm-p-Imlm-y-PyPy-P- 
PyPy-P-Dp, named HIV-2 in Figure 16 and polyamide 2 in Figures 18-24). This polyamide 

20 differs from the HIV-1 polyamide only in the placement of the imidazole and pyrrole amino 
acids and recognizes the TATA box region of the HIV-1 LTR with at least 100-fold reduced 
affinity relative to the HIV-1 polyamide (K a = 2 X 10 8 M" 1 ). In Figure 16, the structures of 
polyamides HIV-1, ImPy-p-ImPy-y-ImPy-P-ImPy-p-Dp and HIV-2, Imlm-P-Imlm-y-PyPy-P- 
PyPy-P-Dp, are shown along with polyamide binding models for the HIV-1 TATA box region. 

25 The filled and unfilled circles represent imidazole and pyrrole rings, respectively, the curved 
line represents y-aminobutyric acid, and the diamond represents P-alanine. Single hydrogen 
bond mismatches are highlighted. 

As noted above, the sequences immediately flanking TATA boxes are unique to each 
30 individual gene with little or no homology between genes. The sequences immediately 5' and 
3' to the HIV-1 TATA element are of the form 5'-WGCWGCW-3' (where W = A or T). A data 
base search revealed that at least one copy of the 5 ! -WGCWGCW-3' sequence is located 
adjacent to the HIV-1 TATA element in all sequences of the HIV-1 LTR found in the EMBL 



WO 98/35702 



PCT/US98/02444 



-42- 

and GenBank data bases. Furthermore the 5'-TGCTGCATATAAGCAGCT-3' TATA element 
and the other prevalent H1V-1 TATA element (5'-TGCTGCATAAAAGCAGCC-3') are found 
only in the various strains and isolates of HIV- 1 and in no other reported sequence in either 
GenBank or the EMBL data base. 

These polyamides were next used in DNase I footprinting experiments and gel mobility 
shift assays with TBP and an HIV-1 LTR restriction fragment or TATA box oligonucleotides 
(as described above). As expected, the match polyamide HIV-1 inhibited TBP binding to the 
HIV-1 TATA element bat the mismatch polyamide HIV-2 did not inhibit TBP binding to this 
same DNA fragment (Figure 19A). DNase I footprint titrations of polyamides HIV-1 and -2 
were preformed in the presence or absence of recombinant human TBP (rhTBP) (- or + TBP). 
A radiolabeled HIV-1 LTR restriction fragment was incubated for 30 min with the following 
concentrations of polyamides prior to addition of rhTBP where indicated: no polyamide, lanes 
2, 6, 10, 14; 2.5 nM polyamide, lanes 3, 7, 1 1, 15; 5 nM, lanes 4, 8, 12, 17; 10 nM, lanes 5, 9, 
13, 17. After an additional 30 min incubation, samples were digested with DNase I. G+A 
chemical sequencing reactions are shown in lanes 1 and 18. The extent of the footprints 
generated by polyamide HIV-1 and TBP, respectively, are indicated at the sides of the 
autoradiogram. 

Complete inhibition of TBP binding was observed at a polyamide concentration of 2.5 
nM, which represents only a five-fold excess of polyamide over DNA binding sites in this 
assay. The HIV-1 polyamide did not inhibit TBP binding to an oligonucleotide corresponding 
to the TATA box region of the adenovirus major late promoter (data not shown). The half-life 
of the HIV-1 polyamide was determined in an experiment in which the polyamide-DNA 
complex was first formed and then challenged with a large molar excess of unlabeled DNA. 
Samples were taken for digestion with DNase after various incubation times and the apparent 
half-life of the HIV-1 polyamide-DNA complex is in excess of 2.5 hours. 

Figure 1 7 illustrates the binding of several classes of polyamides to the DNA sequence 
adjacent to the HIV-1 TATA box (Figure 17, sections la-lc) and adjacent to the Ets-1 / LEF-1 
binding sites. As described above, the polyamide HIV-1 (Figure 17, la) is a hairpin molecule 
that binds to a seven base pair sequence of the form S'-WGCWGCW-^ (where W = A or T). 
A longer twelve base pair sequence can be bound by an antiparallel dimer of the non-hairpin 
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polyaniide ImPy-p-PyPyPyPy-P-ImPy-p-Dp (Figure 17, lb). An even longer sixteen base pair 
sequence can be bound by an antiparallel dimer of the hairpin polyamide ImPy-P-ImPy-p- 
PyPyPyPy-p-IniPy-P-ImPy-p-Dp (Figure 17, lc). Similarly, a six base pair sequence between 
the Ets-1 binding site and the LEF-1 binding site can be bound by the polyamide ImPyPyPy-y- 
5 ImPyPyPy-P-Dp (Figure 17, 2a). The same sequence plus a portion of the LEF-1 binding site 
extending a total of nine base pairs can be bound by the polyamide ImPy-P-ImPyPyPy-y- 
ImPyPyPy-P-PyPy-p-Dp (Figure 17, 2b). 

Example 4: 

10 Inhibition of TBP Binding by Polyamide Ligands 

Figure 18 schematically depicts the HIV-1 promoter and DNA-binding sites for 
polyamide hairpins designed to target the HIV-1 promoter. In Figure 18 A, the DNA binding 
sites for hairpin polyamides designed to target the HIV-1 enhancer and promoter are illustrated, 

15 showing the binding of the polyamide ImPyPyPy-y-ImPyPyPy-p-Dp, here designated 
polyamide 3, also labeled above as polyamide 1 in Figures 1-4 and 12-13. As before, in the 
binding model illustration, the shaded and unshaded circles represent imidazole (Im) and 
pyrrole (Py) rings, respectively, the curved lines represent y-aminobutyric acid (y), and the 
diamonds represent p-alanine (p). The shaded bar schematic of the HIV-1 enhancer and 

20 promoter shows nucleotide positions -170 to the transcription start site at +1. The binding sites 
for the transcription factors USF, Ets-1, LEF-1, NF-kB, SP1 and TFIID (TBP) are indicated. 
The binding sites for polyamides 1, called HIV-1 above, and 3, called HIV-2 above, are 
indicated. The structures of polyamides ImPy-p-ImPy-y-ImPy-P-ImPy-P-Dp (1), Imlm-P- 
Imlm-y-PyPy-p-PyPy-p-Dp (2), ImPyPyPy-y-ImPyPyPy-P-Dp (3), and ImPyPyPy-y- 

25 PyPyPyPy-P-Dp (4), are shown in Figure 18 B, where Dp = dimethyaminopropylamide. 
Binding models and measured dissociation constants (K^) are shown. Mismatched base pairs 
are highlighted. 

It is known that binding of the TBP subunit of TF1ID in the minor groove nucleates 
30 assembly of the RNA polymerase II transcription machinery for TATA-containing genes. In 
general. S. K. Burley, R. G. Roeder describe the biochemistry and structural biology of the 
transcription factor TF11D Ann. Rev. Biochem. 65, 769 (1996); C. P. Verrijzer, R. Tjian 
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describe TAFS which mediate transcription activity and promoter selectivity TIBS 21, 338 
(1996); R. G. Roeder describes the role of general initiation factors in transcription by RNA 
polymerase II TIBS., 21. 327 (1996). 

According to the pairing rules for DNA recognition, the 5 f -WGCWGCW-3' sequence 
adjacent to the HIV-1 TATA element is targeted by hairpin polyamide 1 having sequence 
composition ImPy-p-ImPy-y-ImPy-P-ImPy-p-Dp (Figure 18). The imidazole-pyrrole pair 
(Im/Py) targets a G*C base pair and Py/Im targets OG. Since the p/p pairing recognizes both 
A*T and T»A base pairs polyamide 1 is expected to bind both the 5' and 3* flanking sequences 
of the HIV-1 TATA element (Figure 18A). White, S., Baird, E. E. & Dervan, P. B. describes 
the pairing rules for recognition in the minor groove of DNA by pyrrole-imidazole polyamides. 
Chem. & Biol 4, 569-578 (1997); Swalley, S. E., Baird, E. E. & Dervan, P. B. describe the 
sequence specificity of the p/p pairing Chem Eur. 1 3, 1600 (1997). Quantitative footprint 
titration experiments reveal that polyamide 1 binds both TBP sites as well as a S'-AGCTGCA- 
3' match site overlapping the binding site for Ets-1 with an equilibrium dissociation constant 
(Kd) of 0.05 nM (Figure 19A, lanes 3-5). For DNase I footprinting a 400-bp restriction 
fragment containing the HIV-1 enhancer and promoter regions was isolated from pHIV LTR- 
CAT plasmid DNA (obtained from Dir. K. A. Jones of The Salk Institute, La Jolla, CA) (P.L. 
Sheridan et ai } describe the sequence composition of the plasmid pHIV LTR-CAT Genes Dev. 
9, 2090 (1995)). Labeled DNA was incubated with polyamide (45 minutes) and digested with 
DNase 1 under single-hii conditions. 

Regions of polyamide protection were determined by analysis of the digestion products 
on a 6% denaturing polyacrylamide sequencing gel. Storage phosphorimage analysis of the 
data was used to obtain dissociation constants for the binding reactions (using Kodak Storage 
Phosphor Screens and a Molecular Dynamics SF Phosphorlmager). Recombinant human TBP 
(rhTBP) was obtained from Promega and was used in footprinting and gel mobility shift 
experiments as recommended. The reaction mixtures also contained 100 ng of poly dG-poly 
dC per 10 ^L gel shift reaction or 50 jiL footprinting reaction. For gel mobility shift assays, 
6% polyacrylamide gels (20:1, acrylamide to bisacrylamide) contained 44 mM Tris-borate, pH 
8.3, 1 mM EDTA, 4 mM MgCl2 and 0.02% (v/v) NP-40. The same buffer was used for 

electrophoresis of 1 mm thick gels at 1 5 volts/cm for 2 hours at 4 °C. 



WO 98/35702 



PCT/US98/02444 



-45- 

The HIV-1 TATA region double-stranded oligonucleotide, used as probe for the gel 
shift assay, had the top-strand sequence: 5'- 

CGTCCCTCAGATGCTGCATATAAGCAGCTGCTTTTTGCCTGTACTGGGTC-3', and the 
complementary bottom-strand sequence. The adenovirus major late promoter TATA region 
5 oligonucleotide has the top-strand sequence S'-GATCGGGGGCTATAAAAGGGGGTGGG- 
3\ and the complementary bottom-strand sequence. Each DNA strand (50 ng) was labeled in 

32 

separate reactions with y- P-ATP and T4 polynucleotide kinase and the double-stranded 
oligonucleotides were prepared by annealing equal molar amounts of the two complementary 
strands. Gel shift reactions contained 15 fmol of double stranded oligonucleotide (1.5 nM) 
10 and, where indicated in the figure legends, 50 pg of rhTBP (0.14 nM). Footprinting reactions 
contained the HIV-1 DNA probe (0.5 nM), and polyamide and 5 ng of rhTBP (3 nM) as 
indicated. 

The inhibition of TBP binding to the HIV-1 TATA element by polyamide 1 is shown in 
15 Figure 19. Figure 19A shows DNase I footprint titrations of polyamides 1 and 2 in the 
presence or absence of rhTBP. The radiolabeled HIV-1 LTR restriction fragment was 
incubated for 30 min with the following concentrations of polyamide 1 or 2 prior to addition of 
rhTBP where indicated (+): no polyamide, lanes 2, 6, 10, 14; 2.5 nM polyamide, lanes 3, 7, 11, 
15; 5 nM, lanes 4, 8, 12, 16; 10 nM, lanes 5, 9, 13, 17. After an additional 30 min incubation, 
20 samples were digested with DNase I. G+A chemical sequencing reactions are shown in lanes 1 
and 18. The location of the transcription start site, and the extent of the footprints generated by 
polyamide 1 and TBP (+1), are indicated at the sides of the autoradiogram. Binding sites for 
polyamide 1 are located adjacent to the TATA element and additional sites are found 
overlapping the binding site for the transcription factor Ets-1 (Figure 18A) and in the vector 
25 DNA sequence (upstream of position -1 70). 

A mismatch control polyamide ImIm-P-ImIm-y-PyPy-fJ-PyPy-(3-Dp (2) which differs 
only in ihe placement of imidazole and pyrrole amino acids binds the HIV-1 TATA box region 
with 100-fold reduced affinity relative to polyamide 1. TBP binds the HIV-1 TATA element 
30 with a of -1-3 nM. DNase footprinting assays reveal that polyamide 1 inhibits TBP 
binding at a concentration of 2.5 nM, which represents only a five-fold excess of polyamide 
over DNA binding sites in this assay (Figure 19A., lane 7). Mismatch polyamide 2 fails to 
inhibit TBP binding (Figure 19A., lanes 15-17). 
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A gel mobility shift assay which distinguishes the TBP-DNA complex from free DNA 
and polyamide-DNA complexes was carried out. The results are illustrated in Figure 19B. 
Radiolabeled HIV-1 TATA box (lanes 1-5 and 11-15) or adenovirus major late promoter 
TATA box duplex oligonucleotides (lanes 640) (11) were incubated with the following 
concentrations of polyamide prior to addition of rhTBP, where indicated (+): no polyamide, 
lanes 1, 2, 6, 7, 11, 12; 50 nM polyamide, lanes 3, 8, 13; 100 nM, lanes 4, 9, 14; 200 nM, lanes 
5, 10, 15. After an additional 30 min incubation, samples were subjected to electrophoresis. 
The positions of free DNA probes (F) and the monomer and dimer TBP-DNA complexes (B) 
are indicated alongside the figure. Polyamide 1 inhibits TBP binding to a double stranded 
oligonucleotide corresponding to the HIV-1 TATA box region, while no inhibition is observed 
for control polyamide 2 (Figure 13B). Additionally, polyamide 1 does not inhibit TBP binding 
to the TATA box region of the adenovirus major late (AdML) promoter (5'- 
GGGGGCT ATA AAAGGGGGT-3 ') which contains mismatch (underlined) flanking 
sequences. The half-life of the polyamide 1-DNA complex was determined by competition 
experiments to be in excess of 2.5 hours. 



Example 5: 
Inhibition of LEF-1 binding. 

Polyamides were designed and synthesized to target a sequence adjacent to the binding 
site for LEF-1, a cellular transcriptional activator used by HIV-1. LEF-1 is a member of the 
HMG family of minor-groove binding proteins. J. J. Love, et al. describe the structure of the 
LEF-1 -DNA complex. Nature 376, 791 (1995); J. Kim, F. Gonzales-Scarano, S. Zeichner, J. 
Alwine, Describe replication of HIV-1 containing linker substitution mutations in the -201 to - 
130 region of the long terminal repeat J. Virol 67, 1658 (1993);.In addition to acting as an 
architectural transcription factor, LEF-1 possesses a strong frans-activation domain and has 
been shown to be essential for viral transcription and replication in lymphoid cells. The LEF-1 
binding site is immediately flanked on one side by the sequence 5'-AGTACT-3\ According to 
the polyamide pairing rules, this sequence should be bound by hairpin polyamide 3 of sequence 
composition ImPyPyPy-y-lmPyPyPy-P-Dp which has already been characterized. Trauger, 
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J.W., Baird, E. E. Dervan, P.B. describes the recognition of DNA by designed ligands at 
subnanomolar concentrations. Nature 382, 559-561 (1996). 

Quantitative footprint titration experiments reveal that polyamide 3 binds the 5- 
5 AGTACT-3' site adjacent to the LEF-1 binding site with Krf of about 0.06 nM. (site P3-1 in 
Figure 20A). DNase I footprint titration experiments were performed as described for TBP 
above. Three additional polyamide 3 binding sites present in the HIV-1 enhancer/promoter 
restriction fragment were bound with Krf about 1 nM: S'-TGTACT-S' (located at -7 to -2, P3-3 
in Figure 20A); 5-AGATCT-3' (located at +20 to +25, P3-4 in Figure 20A) and a single base 

10 mismatch site 5*-TCTACA-3' (located at -106 to -111, P3-2 in Figure 20A). DNase I 
footprint ing failed to detect any high affinity sites for mismatch polyamide 4 on the HIV-1 
enhancer/promoter fragment (Figure 20B). Three sites on the HIV-1 promoter/enhancer 
restriction fragment are bound by the 86-amino acid LEF-1 DNA binding domain (DBD), sites 
LI, L2 S and L3 (Figure 20A) with K</ = 1 .4, 5.8, and 4.9 nM, respectively. Recombinant LEF-1 

15 protein containing the 86 amino acid DNA-binding domain (DBD) was the generous gift of J. 
Love (Scripps Research Institute) and was expressed and purified as described. J. J. Love, et al. 
describe the structure of the LEF-1 -DNA complex. Nature 376, 791 (1995). LEF-1 was diluted 
into a buffer containing 10 raM Hepes-OH, pH 7.5, 100 mM KC1, 1 mM dithiothreitol, 1 mM 
MgCl2, 10% (v/v) glycerol and 250 ng/mL BSA. 

20 

DNA binding reactions were performed in the same buffer (without BSA) and also 
contained 250 ng of poly dG-poly dC and the HIV-1 DNA probe at 50 pM in a final volume of 
50 \il For assessing LEF-1 occupancy by DNase footprinting, the intensity of a protein- 
induced DNase I hypersensitive site was used as a hallmark of LEF-1 binding. The LEF-1 

25 footprint at site LI, characterized by a marked DNase I hypersensitive site (HSS), clearly 
changes to the polyamide footprint in the presence of match polyamide 3. LEF-1 binding is 
inhibited 50% at a polyamide 3 concentration of approximately 60 pM. Thus, polyamide 3 
located in the minor groove immediately adjacent to site LI inhibits LEF-1 DBD binding to 
this site. Polyamide 3 only inhibits LEF-1 binding to sites L2 and L3 at markedly higher 

30 polyamide concentrations (3 nM and above, lanes 13-14). Inhibition was observed either by 
adding the polyamide to the DNA before LEF-1 or after preincubation of the DNA with LEF-1, 
consistent with the relative dissociation constants for the two binding reactions. Mismatch 
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polyamide 4, which binds to the 5'-AGTACT-3' sequence with >100-fold reduced affinity, 
fails to inhibit LEF-1 binding (Figure 20B & C). 

The inhibition of LEF-1 binding to the HIV-1 enhancer by polyamide 3 is shown in 

5 Figure 20. In Figure 20A, the 3'- 32 P end-labeled HIV-1 LTR restriction fragment was 
incubated with 10 pM - 1 0 nM polyamide 1 for 45 min prior to the addition of LEF-1 DBD to a 
final concentration of 8 nM (lanes 7 to 14). After an additional 45 min, the samples were 
subjected to DNase I digestion and analyzed by gel electrophoresis. The polyamide 
concentrations were: no polyamide (lanes 1 and 7); 10 pM (lanes 2 and 8); 30 pM (lane 9); 0.1 

10 nM (lanes 3 and 10); 0.3 nM (lane 1 1); 1 nM (lanes 4 and 12); 3 nM (lanes 5 and 13); 10 nM 
(lanes 6 and 14). In Figure 20 B, mismatch polyamide 4 does not bind to the the HIV-1 
enhancer/promoter. The labeled DNA was incubated with the following concentrations of 
polyamides for 45 min: no polyamide (lane 1), 10 nM polyamide 3 (lane 2), 1, 3, 10, and 60 
nM polyamide 4 (lanes 3-6). In Figure 20C, mismatch polyamide 4 does not inhibit LEF-1 

15 binding to the HIV-1 enhancer/promoter. The labeled DNA was incubated with the following 
concentrations of polyamide 4 for 45 min prior to the addition of LEF-1 DBD: 1 nM (lanes 1); 
3 nM (lanes 2); 10 nM (lanes 3); and, 30 nM (lanes 4). LEF-1 DBD was added to a final 
concentration of 8 nM and, after an additional 45 min incubation, foe samples were subjected 
to digestion with DNase I. The location of polyamide sites P3-1 to P3-4, LEF-1 sites LI to L3, 

20 the LEF-1 HSS site and foe start-site for transcription (+1) are shown alongside the 
autoradiogram. 



25 



30 



Example 6: 

Inhibition of RNA Polymerase II Transcription 
by Synthetic DNA-Binding Ligands 

The effects of polyamides 1 and 2 on HIV-1 transcription were tested in an in vitro 
transcription assay with a HeLa cell nuclear extract (Figure 21A). The effects of the 
polyamides on basal transcription by RNA polymerase II was monitored with a nuclear extract 
prepared from HeLa cells (J. D. Dignam, P. L. Martin, B. S. Shastry, R. G. Roeder, Describe 
eukaryolic gene transcription with purified components Meth. Enzymol. 101, 582 (1983)). The 
human lymphoid cell line H9 (ATCC HTB 176) was grown in suspension culture in RPMI 
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medium (Bio Whittaker) supplemented with 10% fetal calf serum (Tissue Culture Biological). 
Whole cell extracts of H9 cells in log phase growth were prepared by hypotonic lysis and 
contained 20 mg/ml protein, as determined by the Bradford reaction. Run-off RNA transcripts 
of -300 bases (CMV MIEP linked to a guanine-less cassette, plasmid pElB-GL and -500 
5 bases (pHlV LTR-CAT) were obtained with EcoRI-digested plasmid DNA (100 ng per 25 nL 
reaction). 

Polyamide-DNA complexes were allowed to form at ambient temperature for 30 min 
prior to addition of the nuclear extract. Transcription complexes were allowed to form for 1 h 

10 at 30 °C prior to a transcription step for 1 h at 30 °C with 10 \xCi of a- 32 P-ATP, 10 \iM 
unlabeled ATP, and 600 jiM of the remaining unlabeled nucleoside triphosphates. RNA was 
purified by extraction with RNAzol (TelTest, Friendswood, TX) and analyzed by 
electrophoresis on a denaturing (8.3 M urea) 6% polyacrylamide gel. Autoradiograms were 
obtained by exposure of the dried gel to Kodak BioMax film with DuPont Cronex Lightning 

15 Plus intensifying screens for 1 to 18 h at -80°C. Relative levels of transcription were estimated 
by storage phosphorimage analysis. 

Polyamide 1, which targets the HIV-1 TATA box, inhibits basal transcription from the 
HIV-1 promoter mediated by the general transcription factors SP1 and RNA polymerase II 

20 (Figure 21 A, lanes 2-4). HIV-1 transcription was inhibited 50% in the presence of between 50 
and 100 nM polyamide 1 in several independent experiments. Polyamide 1 does not inhibit 
transcription from the CMV major intermediate early promoter (MIEP), which contains a 
mismatched TATA-flanking sequence (5'-GAGGTCTATATAAGCAGA-3 5 ). The mismatch 
polyamide 2 does not inhibit transcription from either promoter (Figure 21 A). Titrations were 

25 performed of polyamide 1 over a wide range of concentrations (1 to 200 nM) with both the 
HIV-1 and CMV templates in the same reaction. Under these conditions, 50% inhibition of 
HIV-1 transcription was observed at 30 nM polyamide 1, which corresponds to a 3-5 fold 
excess of polyamide over binding sites. No inhibition of CMV transcription was observed even 
at the highest concentrations of polyamide 1 (Figure 2 IB). Polyamide 1, which has an 

30 additional binding site in the HIV-1 enhancer at nucleotide positions -153 to -147 (overlapping 
the Ets-1 site at -149 to -142), inhibits the DNA binding activity of the Ets-1 transcription 
factor. 
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The effects of polyamides 3 and 4 on HIV-1 transcription in an in vitro system 
consisting of a cell-free extract prepared from cultured human lymphoid H9 cells supplemented 
with HeLa cell extract were determined. The H9 extract contains high levels of LEF-1 protein 
but was found to support only low levels of transcription, suggesting a limited amount of other 
transcription components in this extract. The H9 cell extract was supplemented with small 
amounts of a HeLa cell-derived nuclear extract in order to obtain high levels of transcription 
(Figure 21C, lane 1). The H9 HeLa cell extract stimulates HIV-1 transcription 2.5-3-fold over 
the level of transcription observed with the HeLa extract alone. Immunodepletion of LEF-1 
protein from the H9 extract abolishes this activated transcription (Figure 21C, compare lanes 1 
and 6). 

For immunodepletion and western blot analysis the H9 cell extract was depleted of 
LEF-1 protein with antibody to LEF-1 pi e-bound to protein A sepharose beads. A mixture of 5 
Hi of antiserum and 50 ul of a 1 :1 (v/v) slurry of protein A sepharose in transcription buffer 
supplemented with 2.5 mg/ml bovine serum albumin (BSA) was incubated on a rotator at 4 °C 
for 1 hour. For mock immunodepletion, an equivalent volume of beads was treated identically 
without antibody. Unbound antibody and BSA were removed by brief centrifugation and the 
beads were washed three times with 50 ul of transcription buffer. For immunodepletion, the 
packed protein A beads were incubated with 10 ul of H9 cell extract, 45 ul of transcription 
buffer and 7.5 ul of 50 mM MgCl2 on a rotator at 4 °C for 1 hour. The beads were then 
pelleted by brief centrifugation and the supernatant was tested for transcription activity. The 
efficiency of immunodepletion was determined by subjecting the depleted and mock-depleted 
extracts to SDS-PAGE and western blotting and was found to be greater than 95 %. The blot 
was probed with antibody to LEF-1 (diluted 1:2500, kindly provided by K. Jones, Salk 
Institute, La Jolla, California) and detected by enhanced chemiluminescence (Amersham ECL 
kit). Kodak Bio-Max film was used for detection. 

Polyamide 3 inhibits HIV-1 transcription in this system (lanes 2-5) with a 50% 
reduction of transcription observed at 10-30 nM polyamide. Polyamide 3 fails to inhibit HIV-1 
transcription in the LEF- 1 -depleted extract (lanes 7-10). The activity of the CMV MIEP was 
observed in both the mock-depleted and LEF-1 depleted H9 cell extract, with the result that 
LEF-1 depletion had no effect on CMV transcription. The effect of the mismatch polyamide 4 
on HIV-1 transcription (Figure 21D, lanes 4 and 5) and polyamides 3 and 4 on CMV 
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transcription (Figure 2 ID, lanes 6-10) were examined as additional controls. No potential 
binding sites for either polyamide 3 or 4 are present in the CMV MIEP sequence. As expected, 
polyamide 3 fails to inhibit CMV transcription (Figure 21D, lanes 7-8). The mismatch 
polyamide 4 fails to inhibit either HIV-1 or CMV transcription (lanes 4-5 and 9-10, 
5 respectively). 



In Figure 21 A the inhibition of basal transcription with polyamide 1 is demonstrated. DNA 
templates containing the HIV-1 promoter (lanes 1-4, 9-12) and the CMV major immediate early 
promoter (lanes 5-8, 13-16) were incubated with the following concentrations of polyamide 1 

10 (lanes 2-4, 6-8) or polyamide 2 (10-12, 14-16) prior to the addition of a HeLa cell nuclear extract 
(16): no polyamide, lanes 1, 5, 9, 13; 50 nM polyamide, lanes 2, 6, 10, 14; 100 nM, lanes 3, 7, 11, 
15; 200 nM, lanes 4, 8. 12, 16. In Figure 21B the relative levels of HIV-1 transcription/CMV 
transcription are plotted as a function of polyamide 1 concentration. Data were obtained from 
mixed template reactions containing both DNA templates as described in Figure 21A. In Figure 

15 21C the inhibition of polyamide 3 on LEF-1 -activated transcription is shown. Transcription 
reactions were performed with EcoRI-digested plasmid pHIV LTR-CAT and the H9 whole cell 
extract (2 ul) supplemented with 1 \i\ of the HeLa nuclear extract (lanes 1-5) or with 2 nl of the 
LEF-depleted H9 extract and the HeLa extract (lanes 6-10). Plasmid DNA was incubated with 
polyamide 3 for 15 min prior to addition of cell extracts and other reaction components. The final 

20 concentrations of polyamide in the reaction were: no polyamide (lanes 1 and 6), 10 nM (lanes 2 
and 7), 30 nM (lanes 3 and 8), 100 nM (lanes 4 and 9) and 300 nM (lanes 5 and 10). In Figure 21 
d, transcription with mismatch control polyamide 4 and a control CMV template. Plasmid DNA 
was incubated with polyamide 3(10 nM, lanes 2 and 7; 100 nM, lanes 3 and 8) or polyamide 4 (10 
nM, lanes 4 and 9; 100 nM, lanes 5 and 10) for 15 min at ambient temperature prior to addition of 

25 cell extracts and other reaction components. 

Notably, polyamide 3 does not inhibit basal transcription with the HeLa nuclear extract 
although binding sites for this polyamide are present at the start-site for transcription and 
within the HIV-1 mRNA coding sequence present in plasmid pHIV-CAT. These observations 
30 suggest that RNA polymerase II can transcribe DNA with a polyamide bound in the minor 
groove and that polyainides are only inhibitory to transcription when these compounds 
interfere with the DNA binding activity of a required transcription factor. 
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Example 7: 
Inhibition of RNA Pol II Transcription in 
Human Cells by Synthetic DNA-Binding Ligands 

The polyamides of the present invention have been demonstrated to inhibit HIV-1 
transcription in virus-infected human cells. In general, M. E. Klotman, S. Kim, A. Buchbinder, 
A. DeRossi, D. Baltimore, F. Wong-Staal describe the kinetics of expression of multiply 
spliced RNA in early HIV-1 infection of lymphocytes and monocytesj°roc. Natl. Acad. Sci. 
U.S.A. 88, 5011-5015 (1991). Since there are multiple spliced and unspliced species of HIV-1 
RNA with different turn-over kinetics, viral transcription was not monitired by measuring 
levels of RNA. Instead, the effects of the polyamides on the levels and kinetics of HIV-1 
replication in isolated human peripheral blood mononuclear cells (PBMC) was observed in 
culture. PBMC were infected with the T cell-tropic HIV-1 strain WEAU1.6, or the 
macrophage-tropic strain SF162. S. J. Clark et al describe high titers of cytopathic virus in 
plasma of patients with symptoms of primary human imunodeficiency virus type-1 infection. 
N. Engl. J. Med. 324, 954 (1991); P. Borrow, et al., describe antiviral pressure exerted by 
HIV-1 specific cytotoxic T-lymphocytes (CTLS) during primary infection demonstrated by 
rapid selection ofCTL escape virus Nature Med 3, 205 (1997); C. Cheng-Mayer, D. Seto, M. 
Tateno, J. A. Levy describe biologic features of HIV-1 that correlate with virulence in the 
hostScience 240, 80 (1988); C. Cheng-Mayer, C. Weiss, D. Seto, J. A. Levy, Describe isolates 
of HIV-1 from the brain may not constitute a special group of the AIDS virus. Proc. Natl. 
Acad. Sci. USA 86, 8575 (1989). Polyamides were added to the culture medium and the levels 
of HIV-1 p24 viral capsid protein in the culture media were determined on subsequent days 
using a standard ELISA assay. 

Human PBMC were separated from whole blood collected from normal adult 
volunteers by density gradient centrifugation as described. D. E. Mosier, et al. describe HIV-1 
infection of human-PBL-SCID mice Science 251, 791 (1991); D. Mosier, R. Gulizia, P. 
Maclsaac, B. Torbett, J. Levy, describe rapid loss of CD4+ T-cells in human-PBL-SCID mice 
by noncytopalhic HIV isolates Science. 260, 689 (1993). Donors were provided by the General 
Clinical Research Center of The Scripps Research Institute, which is supported by NIH grant 
MOl RR00833. Human PBMC were activated with 2 ug/mL PHA and 20 units/mL of IL-2 
for 2-3 days prior to HIV-1 infection. Each culture of 5 x 10 5 PBMC was infected with 10 3 
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tissue culture infectious doses of HIV-1 for 24 hours. Free virus was removed by washing the 
cells in medium, and polyamides added to the culture. In the experiment shown in Figure 22C, 
polyamides were added 24 hours prior to virus exposure, and were continuously present 
thereafter. Vims replication in culture was measured by HIV-1 p24 viral capsid antigen 
5 ELISA (DuPont Medical Products, Boston, MA). 

Assays of HIV-1 replication representative of five replicate experiments with five 
human PBL donors are shown in Figures 22 and 23, illustrating results from two experiments 
employing different donors. Each experiment is controlled by using polyamides which differ 
10 by single atomic substitution from the inhibitory polyamides. In control PBMC cultures with 
no added polyamide, viral replication resulted in increasing p24 levels between day 4 and day 
10 of culture (Figure 23). Addition of mismatch control polyamides 2 and 4 had no effect on 
the level of virus in the medium, either alone (Figure 22, A, B, & D) or in combination (Figure 
22C). 

15 

in contrast, both polyamides 1 and 3 added to the culture medium reduced p24 levels in 
a dose-dependent manner (Figure 22A & B). Polyamide 1 at 1 jiM concentration caused an 
80% reduction in virus (Figure 22 A), while polyamide 3 at 10 \iM concentration caused a 60% 
reduction (Figure 22B). These inhibitory effects were clear at 6 or 8 days of culture, but 
20 became less pronounced at later times. The two polyamide ligands, when used individually, 
delay the appearance of virus rather than absolutely inhibit virus production. 

The combination of polyamides 1 and 3 at 1 \iM each can act in synergy to reduce viral 
p24 levels to below the threshold of detection (<10 pg/ml; greater than 99.9% inhibition of 
25 viral replication) (Figure 22D, 23A & C), and were as effective as 1 ^M azidothymidine (AZT) 
in blocking HIV-1 replication (Figure 22D). The macrophage-tropic SF162 isolate, which 

replicates in both macrophages and CD4+ T lymphocytes, was more difficult to inhibit with 
single polyamides, but the combination of 1 \iM polyamides 1 and 3 was able to reduce and 
eventually block its replication (Figure 23A). These results demonstrate that cell-permeable 
30 synthetic DNA ligands can effectively inhibit HIV-1 replication in isolated human 
lymphocytes in culture and suggest that inhibition of virus replication is due to inhibition of 
transcription factor-DNA interactions and gene transcription of viral and possibly cellular 
genes by RNA polymerase II. The observed polyamide inhibition of virus replication is likely 
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due to interference with the DNA binding activities of TBP and Ets-1 by polyamide 1 and the 
binding activity of LEF-1 by polyamide 3. The inhibitory effects of polyamides singly or in 
combination was not due to obvious toxicity. No significant decrease in cell viability was 
apparent in PBMC cultures treated with polyamides 1 and 3 for 10 days, in contrast to 90% 
mortality observed for PBMC cells treated with 1 jiM AZT for the same period (Figure 23B & 
D). Cell viability was not impacted by polyamide treatment, but was reduced by AZT 
treatment. 

The combination of polyamides 1 + 3 inhibited HIV-1 replication, but the closely 
related polyamides 2 + 4 did not. The simplest explanation for this finding is that polyamides 
1+3 inhibited HIV-1 RNA transcription in cells as well as in the in vitro assays (Figure 22), 
but it is possible that inhibition of cellular genes involved in T cell activation could result in an 
indirect effect on HIV-1 replication. To further assess this possibility, a sensitive RNAase 
protection assay was performed for transcripts of a number of cytokine genes, including IL-2, 
IL-5, and IL-13 whicli differ by only single base mismatches from the target sequences 
flanking the TATA box in the HIV-1 LTR. Four other cytokine genes that lack binding sites 
for either polyamide 1 or 3 in their promoters were also examined (Figure 24A). The results 
(Figure 24B) show that exposure of activated human PBMC to a combination of either 
polyamides 1 + 3 or 2 + 4 (1 uM each) for 6 days failed to inhibit cytokine RNA expression. 
This lack of inhibition of cytokine gene transcription suggests that the polyamides reduce virus 
replication in cells by a direct effect on HIV-1 RNA transcription. 

Our present studies have utilized polyamides designed to target DNA sequences 6-7 bp 
in length and have shown that these compounds are effective inhibitors of gene transcription in 
cell- free systems and, vidci infra, in human cells. Because sequences of these lengths would be 
highly redundant in the human genome, it had seemed likely that these ligands would have 
deleterious effects on cell metabolism due to interference with the activity of cellular genes. 
However, the results described here indicate that a battery of polyamides which recognize 6-7 
bp sequences will be sufficient for gene-specific regulation in vivo. It is interesting to compare 
these small molecule transcription factors to eukaryotic transcriptional regulatory proteins 
which also recognize multiple sequences of similar length in tandem in order to increase 
functional specificity. The observations that polyamides do not interfere with pol II elongation, 
and that polyamides can bind simultaneously with certain major groove binding proteins 
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should further enhance gene-specificity. M. G. Oakley, M. Mrksich, P. B. Dervan describe 
simultaneous binding of polyamide in the minor groove with a protein in the major groove. 
Biochemistry 31, 10969 (1992). 

5 As shown in Figure 16, polyamides are not limited to 6-7 base-pair recognition, but can 

bind as cooperative (timers to sequences up to sixteen base pairs in length. Trauger, J.W., 
Baird, E. E. Mrksich, M., Dervan, P.B. describes the recognition of 9-13 base pairs of DNA by 
a P-alanine linked extended polyamide dimer. 

10 The specific inhibition of genes transcribed by RNA polymerase II represents an 

important first step toward asking whether cell-permeable small molecule transcription 
antagonists might regulate gene expression in complex organisms. TBP and sequences 
adjacent to the TATA element for inhibition of basal transcription by RNA polymerase II were 
chosen. Since most tissue-specific cellular genes and viral genes contain TATA elements, this 

15 approach is generally applicable for the inhibition of most target genes. 

The inhibition of HIV-1 replication in peripheral blood mononuclear cells by 
polyamides is shown in Figure 22. Panels A-D depict three separate experiments in which 
polyamides alone (A, B) or in combination (C, D) were added to cultures of human peripheral 

20 blood mononuclear cells (PBMC) stimulated 3 days earlier with phytohemagglutinin and 
interleukin-2. PBMC cultures were infected with the primary HIV-1 isolate WEAU 1.6 
(kindly provided by G. Shaw), and virus replication measured by release of p24 capsid antigen 
into the medium that was detected by antibody-capture ELISA (Dupont). Each experiment 
involved a separate PBMC donor. Values shown are for day 6 or day 8 after virus infection. 

25 When two polyamides were combined, the concentration shown is for each component of the 
mixture. In panel D, the combination of 1 fiM polyamide 1 + 1 \iM polyamide 3 cooperatively 
blocked virus infection and p24 release (below 10 pg/ml, the detection limit of the assay), as 
did the addition of 1 pM azidothymidine (AZT). Assays of p24 concentration were performed 
in duplicate and showed less than 5% variation from the mean value reported. 

30 

Figure 23 shows the kinetics of the inhibition of HIV-1 replication and effects on cell 
viability by a combination of polyamides added to PBMC cultures. PBMC were isolated as in 
Figure 22 and infected either with the macrophage-tropic HIV-1 isolate SF162 (panels A and 
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B) or the T-cell tropic isolate WEAU 1.6 (panels C and D). Polyamides 1 and 3 were added in 
combination at 1 uM each. Although each polyamide alone was not very effective in blocking 
replication of HIV-1 SF162 (data not shown), the combination of both polyamides was able to 
stabilize and later reduce virus p24 production to undetectable levels (panel A, filled triangles) 
while untreated cultures (open triangles) continued viral replication. Cell viability was 
determined at days 4 and 10 of culture by trypan blue exclusion. 

The combination of polyamides 1 and 3 caused a small decrease in viability compared 
to cultures that were untreated (panel B, filled versus open triangles), but parallel cultures 
treated with 1 uM azidoihymidine (AZT) showed a much more severe decline in cell viability. 
AZT completely inhibited vims replication of both HIV-1 SF162 and WEAU 1.6 at this 
concentration (data not shown). In contrast to these results, each polyamide alone partially 
inhibited replication of HIV-1 WEAU 1.6 (see Figure 23 A & B), and the combination of 
polyamides 1 and 3 blocked replication at all times examined (panel C, filled circles) compared 
to untreated cultures (open circles). Cell viability was higher in cultures treated with 
polyamides 1 and 3 than in untreated cultures (panel D, filled circles versus open circles), 
probably because the cytopathic effect of HIV-1 infection was completely reversed. AZT 
again showed substantial toxicity for cells. Assays of P 24 concentration were performed in 
duplicate and showed less than 5% variation from the mean value reported. Similar results 
were obtained in two additional experiments. 

A lack of inhibition of cytokine gene expression with polyamides targeted to the HIV-1 
promoter and enhancer sequence can be seen in Figure 24. In Figure 24A the sequences of 
the TATA-box region (taken from GenBank listings) of each of the cytokine/growth factor 
genes examined are shown with the TATA box in bold and the binding site for polyamide 1 
underlined. Single base mismatches are indicated in lower case. In Figure 24B, human 
peripheral blood mononuclear cells were cultured under the same conditions used for HIV-1 
infection. Cultures were either left untreated, or 10 uM of polyamides 1 + 3 or polyamides 2 + 
4 added. After six days, cells were harvested, total RNA extracted, and a ribonuclease 
protection assay performed with riboprobes specific for the indicated cytokines as well as for 
CD4, CDS and ribosomal protein L32. M. V. Hobbs, et al., describe patterns of cytokine gene- 
expression by CD4+ T-cells from young and old mice J. Immunol 150, 3602 (1993). After 
digestion with RNAse Tl, protected fragments were separated by polyacrylamide gel 
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electrophoresis, and the amount of labeled probe quantitated by Phosphorlmager analysis. 
Data are expressed as the intensity of each cytokine RNA relative to the intensity of the 
ribosomal L32 RNA band to standardize for RNA loading. There was no difference in the 
intensity of CD4 and CDS RNA bands between groups, indicating equivalent recovery of CD4 
5 and CDS T cells in all cultures. Similar results were seen when the cultured cells were 
analyzed after 10 days, although the levels of RNA for most cytokines had declined in both 
polyamide treated and untreated cells. 

10 

Example 8: 
Differential Inhibition of Ets-1 
Binding With Two Polyamides 

Ets-1 is another cellular transcription factor required for high levels of HIV-1 RNA 
15 synthesis (Figure 25). Figure 25 shows the schematic structures of polyamides targeted to the 
Ets-l/LEF-1 region of the HIV-1 enhancer and the sequence of the binding sites for these 
polyamides. Polyamides 1 and 2 have been described above in the context of the LEF-1 
inhibition experiments above . Polyamide 3 (ImPy-(i-ImPy-y-ImPy-p-ImPy-P-Dp) binds the 
sequence S'-TGCTGCA-S* with a Kd of 0.05 nM, while the mismatch polyamide 4 (Imlm-P- 
20 Imlm-y-PyPy-P-PyPy-p-Dp) exhibits 100 fold lower affinity for binding that site. The binding 
sites for polyamides 1 and 3 are located immediately downstream (polyamide 1) and upstream 
(polyamide 3) of the Ets-1 DNA recognition sequence in the HIV-1 enhancer region (Figure 
25B). Polyamide 3 also recognizes sequences immediately flanking the TATA box of the 
HTV-1 promoter, and was shown to inhibit DNA-binding of TBP and prevent basal 
25 transcription from this promoter. 

Schematic representation of the HIV-1 enhancer/promoter region with the DNA- 
binding sites for the transcription factors USF, Ets-1, LEF-1, NF-kappaB, Spl and the TATA- 
binding protein TBP. The positions of the polyamide binding sites are indicated by vertical 
30 arrows. The DNA-binding sites for polyamides 1 and 3 flanking the Ets-1 recognition site are 
shown in detail below. The Ets-1 binding site is boxed, the GGA core recognition sequence, 
that is conserved in most Ets-protein binding sites, is indicated by a small rectangle. The LEF-1 
binding site is underlined. 
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Since the Ets-1 binding site is flanked by and partially overlaps with both polyamide 1 
and polyamide 3 binding sites, whether Ets-1 binding could be inhibited by either one, or both 
of these two polyamides was tested by gel mobility shift experiments. When polyamides were 
preincubated with a radiolabeled double-stranded HIV oligonucleotide before adding Ets-1 
AN331, polyamide 1 had no effect on Ets-1 DNA-binding, even at a concentration as high as 
400 nM (Figure 26 lanes 3-5). Polyamide 3, however, prevented the Ets-l/DNA complex 
formation (Figure 26, lanes 9-11). When both polyamides were combined, the degree of 
inhibition was very similar to the one observed with polyamide 3 alone (Figure 26, lanes 15- 
17). The two mismatch polyamides 2 and 4 did not prevent complex formation, either alone or 
combined (Figure 26, lanes 6-8, 12-14 and 18-20). A titration experiment revealed that 
polyamide 3 inhibited Ets-1 AN331/DNA complex formation by 50% at a concentration of 
approximately 6 nM, nearly complete inhibition was achieved between 50 and 200 nM 
polyamide. Mismatch polyamide 4 had virtually no effect in the same concentration range. 

A double-stranded, labeled oligonucleotide corresponding to the Ets-1 binding site with 
flanking regions within the HIV-1 enhancer/promoter was used as a probe in gel mobility shift 
assays. Polyamides were preincubated with the DNA before addition of Ets-1, where indicated. 
The concentrations of polyamides were 100 nM (lanes 3, 6, 9, 12, 15 and 18), 200 nM (lanes 4, 
7, 10, 13, 16 and 19), and 400 nM (lanes 5, 8, 1 1, 14, 17 and 20). AN331 protein was added at 
a concentration of 12 nM. Positions of the free probe (F) and bound probe (B) are indicated. 
Figure 27A shows a representation of an autoradiogram of a representative gel mobility shift 
assay showing the inhibitory effect of polyamide 3 on AN331 binding. The concentration of 
AN331 was constant (12 nM in lanes 2-10), the probe concentration was 50 pM. The 
polyamide was preincubated for 20 min at room temperature before addition of the protein, 
followed by an additional 20-30 min incubation on ice. The polyamide concentrations were 
1.56, 3.125, 6.25, 12.5, 25, 50, 100, 200 nM in lanes 3 to 10, respectively. Figure 27B is a 
graphical representation of the decrease of the fraction of bound probe as a function of 
polyamide concentration. Closed squares represent the data points obtained for polyamide 3, 
open squares represent mismatch polyamide 4. Figure 27C is a graphical representation of 
the effect of polyamide 3 on Ets-1 binding when added at different time points during the 
binding reaction. Polyamide 3 was added to the probe either prior to addition of Ets-1 (closed 
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squares), or after addition of Ets-1 (open squares), or simultaneously with Ets-1 (closed 
triangles). 

Figure 28 is a representation of the results of DNase I footprint titration experiment 
5 with polyamides 1 and 3 in the absence (-) or presence (+) of 9.6 nM AN331 protein. 
Polyamides were incubated with the radiolabeled probe prior to addition of Ets-1. The 
polyamide concentrations were 0 nM (lanes 2, 6, 10, 14), 4 nM (lanes 3, 7, 11, 15), 20 nM 
(lanes 4. 8, 12, 16) and 100 nM (lanes 5, 9, 13, 17). Lanes 1 and 18 show a G+A sequencing 
ladder. The regions protected by polyamides and by Ets-1 are indicated alongside the 
10 sequencing ladder. Note the two DNase T hypersensitive sites characteristic of the Ets-1 
footprint. 



Polyamide 3 prevents Ets-1 DNA-binding while polyamide 1 coexists with Ets-1 on 
overlapping DNA binding sites: 

15 

A labeled DNA fragment derived from the HIV-1 enhancer was incubated with each 
polyamide either alone, or followed by addition of Ets-1 and a further 30 minute incubation. As 
expected, both- polyamides bound their target site with similar affinities, polyamide 1 even 
bound with a slightly higher affinity than polyamide 3, complete protection was observed with 

20 20 nM polyamide 1, and with 100 nM polyamide 3 (Figure 28, lanes 3-5 and 11-13). The Ets-1 
footprint is characterized by two DNase I hypersensitive bands appearing in the center and at 
the 5' boundary of the footprint. These hypersensitive sites disappear with the addition of 20 to 
100 nM polyamide 3 (Figure 28, lanes 7-9), but remain unchanged with the addition of 
polyamide 1 (Figure 28, lanes 15-17). The simultaneous presence of polyamide 1 and Ets-1 

25 results in a broadening of the footprint, corresponding to a combined footprint created by Ets-1 
and polyamide 1 (Figure 28, lanes 14-17), while the Ets-1 footprint is replaced by the 
polyamide 3 footprint (lanes 7-9). Based on the structure of the Ets family protein PU.l in 
complex with its DNA recognition element, it has been proposed that DNA recognition is 
mediated by both major groove and minor groove contacts. In the case of Ets-1, minor groove 

30 contacts would be expected to occur at the 5' end of the recognition element. This structural 
prediction is in complete agreement with our polyamide inhibition experiments. A polyamide 
located at the 5' end of the Ets-1 site is inhibitory to DNA binding whereas a polyamide 
located at the 3' end of the Ets-1 site is not inhibitory to DNA binding. 
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Example 9: 

Inhibition of TBP Binding and Basal Transcription with Polyamides Designed to Bind 
Sequences Adjacent to the TATA Element 

The previous Examples demonstrate that polyamides which bind sequences adjacent to 
the well-conserved TATA sequence upstream from the transcription start-site of messenger 
RNA-coding genes inhibit TBP binding and basal transcription by RNA polymerase II. This 
method is a useful general approach for modulation of gene expression. For example, specific 
polyamides have been designed and synthesized that bind to double stranded DNA adjacent to 
the TATA element of the human cytomegalovirus major immediate early promoter (CMV 
MIEP). Polyamide CMV-1, ImlmPyPy-y-ImPyPyPy-P-Dp, binds the identified target 
sequence 5'-AGGTCT-3\ where the last T of this sequence is the 5' T of the TATA element. 
Polyamide CMV-1 binds the CMV MIEP with an apparent dissociation constant of 1 nM and 
is an effective inhibitor of TBP binding and basal transcription. Appropriate mismatch 
polyamides do not inhibit either TBP binding or transcription. 

In another example, a polyamide was designed to bind immediately downstream of the 
TATA element found in the human Her-2/neu breast cancer oncogene promoter. This 
polyamide, Her2-1, of composition ImPy-P-Pylm-y-PyPy-P-PyPy-P-Dp, binds the sequence 
5'AGAATGA-3' (where the 5' A of this sequence is the 3' A of the TATA element) with an 
apparent dissociation constant of 20 pM and is an effective inhibitor of TBP binding and 
transcription. Her-2/neu is recognized to be over-expressed in several cancers, including 
human gynecologic adenocarcinomas, including those of the ovary, endometrium, breast, 
fallopian tube and cervix (Cirisano, F.D., & Karlan, B.Y., J. Soc. Gynecol. Jnvestig. 3 99-105 
(1996)). 

These additional data confirm the generality of this approach and demonstrate that 
polyamides located cither upstream or downstream of the TATA element are effective 
transcription inhibitors. Table 3, below, lists the gene promoters, TATA sequences and 
composition of the polyamides which have been shown to be inhibitors of TBP binding and 
transcription. 
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Table 3 

Polyamide Inhibition of TBP Binding and Basal Transcription by Targeting 
Sequences Adjacent to the TATA Element 


GENE 


TATA SEQUENCE 


POLYAMIDE 


Artificial 
TATA 


5 '-TATAAGTACTT-3 ' 


ImPyPyPy-y-ImPyPyPy-(3-Dp 


HIV-1 
Promoter 


5 '-TGCTGCATATAA-3 ' 
5'-TATAAGCAGCT-3' 


ImPy-p-Pylm-y-PyPy-P-PyPy-P-Dp 


CMV 
MIEP 


5 ' - AGGTCTATAA-3 ' 


ImlmPyPy-y-ImPyPyPy-P-Dp 


Her-2/neu 
oncogene 
(Breast Cancer) 


5'-TATAAGAATGA-3' 


ImPy-P-PyIm-y-PyPy-(3-PyPy-P-Dp 



Example 10: 

Anti-repression of Polymerase II Transcription by a Designed Ligand: 

5 

Pyrrole-imidazole polyamides can exert a positive effect on transcription by interfering 
with the activity of a specific repressor protein. The human cytomegalovirus (HCMV) IE86 
repressor protein is well suited for this study, because transcriptional repression is dependent 
on IE86 binding to its DNA target site. IE86 negatively regulates the major immediate early 

10 promoter (MIEP) of HCMV by binding to a sequence element (the cis repression signal, crs) 
located between the TATA box and the start of transcription. It was shown that IE86 exerts its 
negative effect on transcription by binding to the crs element, thereby blocking recruitment of 
RNA polymerase II. The polyamides that specifically recognize the crs, prevent IE86 from 
binding and relieve transcriptional repression, while a mismatch polyamide or a polyamide 

15 which binds to a nearby site, have no effect. Furthermore, occupancy of the crs element by a 
small polyamide is not sufficient to repress transcription, supporting the model that IE86- 
mediated repression results from steric interference with the recruitment of RNA polymerase 
II to the preinitiation complex. 

20 Figure 29 is a schematic representation of the sequence of the HCMV MIEP region 

from position -34 to +7. The TATA box and the repressor binding site (located at -14 to +1 
relative to the start site of transcription) are boxed. The polyamides are schematically 
represented at their respective DNA binding sites. The shaded and open circles represent 
imidazole and pyrrole rings, respectively, the hairpin junction is formed by y-aminobutyric 
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acid, and the diamond represents p-alanine. Polyamides 2 and 3 contain an additional amino 
group at the hairpin junction. The mismatch in polyamide 4 is boxed. Structures of five 
polyamides are shown schematically: ImPylmPy-y-PyPyPyPy-p-Dp (1), ImPyImPy-2,4D- 
PyPyPyPy-P-Dp (2), Im-p-ImPy-2,4D-PyPyPyPy-p-Dp (3), ImPyPyPy-y-PyPyPyPy-p-Dp 
(4) and ImlmPyPy-y-ImPyPyPy-p-Dp (5), which binds the sequence 5'-AGGTCT-3' adjacent 
to the TATA box. Im = imidazole, Py = pyrrole, y = y-aminobutyric acid, p = p-alanine, 2,4D 
= 2,4 diaminobutyric acid, and Dp =dimethylaminopropylamide. 

The eight ring polyamide 1, ImPylrnPy-y-PyPyPyPy-p-Dp, binds the six base pah- 
sequence 5-AGTGAA-3' within the IE86 binding site with an apparent dissociation constant 
of 1.4 nM. The predicted structure of this complex is shown in Figure 29 (polyamide 1). Two 
additional polyamides were synthesized that bind the same site, but with higher affinities: 
polyamide ImPylmPy-D-PyPyPyPy-p-Dp (polyamide 2, where 2,4D represents 2,4 
diaminobutyric acid) and Im-P-ImPy-2,4D-PyPyPyPy-P-Dp (polyamide 3) bind the sequence 
5'-AGTGAA-3' with Kjs of 1 nM and 0.25 nM, respectively. A single atom substitution in 
this molecule, changing an imidazole to a pyrrole (nitrogen to C-H; polyamide 4, shown in 
Figure 29), reduces the affinity of this compound by -30 to 100-fold for its target site. A 
control polyamide, ImImPyPy-7-ImPyPyPy-P-Dp (polyamide 5) binds the sequence 5'- 
AGGTCT-3' adjacent to the TATA box of the MIEP with a Kd of 1 nM. 

These compounds together with purified recombinant IE86 protein were used in DNase 
I footprint experiments with a singly end-labeled restriction fragment derived from the CMV 
major immediate early promoter. As predicted, polyamide 1 protects the sequence 5'- 
AGTGAA-3' in the center of the IE86 binding site. Approximately 50% protection is seen 
with 2 nM polyamide, and complete protection is obtained with 200 nM polyamide 1 (Figure 
30, lanes 3-5). Recombinant protein IE86 protects a region extending between positions -24 
and +3. Approximately 250 nM IE86 is required to give complete protection (Figure 30, lanes 
6 and 14). When polyamide 1 is preincubated with DNA for 15 minutes prior to addition of 
protein, the IE86 footprint is partially replaced by the polyamide 1 specific footprint at a 
concentration of 2 nM polyamide, and is completely replaced at a concentration of 200 nM 
polyamide (Figure 30, lanes 7-9). The control polyamide 5, which protects a site 5'-AGGTCT- 
3' immediately upstream of the TATA box, does not interfere with IE86 binding in a 
concentration range of 2-200 nM. Both the IE86-specific footprint and the polyamide 5 
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footprint coexist, even though they are only 9 base-pairs apart (Figure 30, lanes 14-17). A 
singly-end labeled restriction fragment was used in DNase I footprinting reactions. Reactions 
contained approximately 250 nM his-tagged IE86 where indicated, and polyamide 1 in the 
concentrations indicated at the top of the lanes. Lanes 1 and 18 represent a G+A sequencing 
5 ladder. The extent of the footprints created by polyamide 1, IE86 and the control polyamide 5 
are indicated by brackets alongside the autoradiogram. The location of the TATA box and the 
start of transcription are indicated. 

The same experiment was repeated with polyamides 2 and 3, which bind the same site 
10 5'-AGTGAA-3' with higher affinities than polyamide 1. A complete protection is seen with 5 
and 10 nM polyamide 3 (Figure 31, lane 4), which are also the concentrations required to 
completely inhibit IE86 from binding (Figure 31, lanes 8 and 9). A partial inhibition of IE86 
binding is detected at 1 nM polyamide 3 (Figure 31, lane 7). A similar result was obtained 
with polyamide 2 (data not shown). The mismatch polyamide 4 did not give a footprint on the 
15 CMV fragment in the concentration range used in these experiments (1 - 200 nM), and it did 
not interfere with IE86 binding (Figure 31, lanes 11-13 and 15-17). The IE86 concentration 
was approximately 250 nM. The concentrations of the polyamide are indicated in nM at the top 
of the lanes. The extent of the footprints for polyamide 3 and IE86 protein as well as the TATA 
box and transcription start site are shown at the side of the autoradiogram. 

20 

The effects of the polyamides on MIEP transcription were tested in an in vitro system 
consisting of a cell-free extract prepared from cultured human CEM cells. It is known from 
previous studies that IE86 inhibits transcription by binding to the cis repression sequence, and 
physically blocks the access of RNA polymerase II to the preinitiation complex. Whether 
25 polyamide 1, which has a relatively low binding affinity (in about the same range as IE86), 
could counteract the negative effect of IE86 on transcription was tested. Run-off transcription 
reactions were performed with linearized plasmid and CEM extract. The concentration of IE86 
that was needed to give approximately 75% inhibition of transcription was empirically 
determined. The polyamides were added at a concentration of 1 nM. 

30 

Figure 32 shows that IE86 inhibits CMV transcription by approximately 75% (lane 2). 
When polyamide 1 is preincubated with DNA prior to addition of IE86 protein, the level of 
transcription is restored to approximately 75% of the control (Figure 32, lane 3). Interestingly, 



WO 98/35702 



PCT/US98/02444 



-64- 

the presence of polyamide 1 bound to the crs in the absence of IE86, has no effect on 
transcription, even at a concentration of 1 \iM (Figure 32, lane 4). An unrelated polyamide, 
which has no binding site in the CMV promoter, has no effect on IE86-mediated repression, 
even at a concentration of 1 \iM (Figure 32, lane 5). Thus, polyamide 1 is able to specifically 
5 counteract transcriptional repression mediated by IE86 protein. 

The foregoing is intended to be illustrative of the present invention, but not limiting. 
Numerous variations and modifications of the present invention may be effected without 
departing from the true spirit and scope of the invention. 

10 
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1 . A method of modulating the expression of a cellular or viral gene comprising the steps 
of: 

a. identifying a unique target DNA sequence adjacent to the binding site of at least one 
minor groove transcription factor protein; 

b. chosing a polyarnide having a subnanomolar affinity for the unique target DNA 

sequence; and 

c. contacting the unique target DNA sequence with a transcription inhibiting amount of 
the polyarnide. 

2. The method of claim 1 comprising the additional step of designing the polyarnide to be 
specific for the unique target DNA sequence. 

3. The method of claim 1 wherein the minor groove transcription factor protein is chosen 
from the group consisting of TFIIIA, TBP, LEF-1, and Ets-1. 

4. A method for modulating expression of a cellular or viral gene controlled by a minor 
groove transcription factor protein using a specifc polyarnide including paired 
carboxamide residues, comprising the steps of: 

a. identifying a.target sequence of double stranded DNA adjacent to the binding site 
of the minor groove transcription factor protein, the target sequence having the form 
5'-WNiN2 ... N m W-3', wherein N]N2 . . . N m is the sequence to be bound by 
carboxamide residues, wherein each N is independently chosen from the group A, 
G, C, and T, each W is independently chosen from the group A and T, and m is an 
integer having a value from 3 to 6; 

b. selecting a specific polyarnide that is selective for identified target DNA sequences 
5'-WNiN2 . . . N m W-3' and having the form 

X1X2 . . . X m -y-X( m + 1) . . . X(2m-l)X2m 

c. wherein X], X2, X m , X( m -1- 1), X(2m - 1)> and ^2m are carboxamide residues 
forming carboxamide binding pairs Xi/X2m, X2/X(2m-1)> X m /X(m + 1), and y is 
y-aminobuytic acid or 2,4 diaminobutyric acid and Dp is 
dimethylaminopropylamide, where m is an integer having a value from 3 to 6; and 

d. contacting the target sequence with a transcription inhibiting amount of the specific 
polyarnide. 
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The method of claim 4 further comprising the step of designing the specific polyamide 
of the form 

X1X2 • . • X m -y-X(m + 1) . . . X(2m-l)X2m- 
comprising the steps of: 

a. representing the identified target sequence as 5'-WaA . . . *W-3\ wherein a is a first 
nucleotide to be bound by the Xi carboxamide residue, ft is a second nucleotide to 
be bound by the X2 carboxamide residue, and x is the corresponding nucleotide to 
be bound by the X m carboxamide residue; 

b. defining a as A, G, C, or T to correspond to the first nucleotide to be bound by a 
carboxamide residue in the identified sequence; 

c. selecting Im as the Xi carboxamide residue and Py as the X 2m carboxamide 

residue if a = G; 

d. selecting Py as the Xi carboxamide residue and Im as the X2m carboxamide 
residue if a = C; 

e. selecting Hp as the Xi carboxamide residue and Py as the X 2m carboxamide 
residue if a = T; 

f. selecting Py as the X] carboxamide residue and Hp as the X2m carboxamide 

residue ifa = A; and 

g. repeating steps c - g for b through x until all carboxamide residues are selected. 

6. The method of claim 4 further comprising the step of replacing at least one pyrrole 
residue with a p-alanine residue. 

7. A method of inhibiting the replication of a pathogen by administering a transcription 
inhibiting amount of at least one polyamide compound. 

8. The method of claim 7 wherein the pathogen is chosen from the group consisting of 
viruses, bacteria, fungi and protozoans. 

9. The method of claim 7 wherein the pathogen is a retrovirus. 

10. The method of claim 7 wherein the pathogen is HIV-1. 

11. The method of claim 7 wherein the polyamide is chosen from the group consisting of 
ImPyPyPy-Y-ImPyPyPy-P-Dp, ImPy-P-ImPy-Y-ImPy-P-ImPy-P-Dp, ImPy-P- 
PyPyPyPy-p-ImPy-p-Dp, ImPy-P-ImPy-p-PyPyPyPy-P-IniPy-p-ImPy-P-Dp, ImPy-p- 
ImPyPyPy-Y-ImPyPyPy-p-PyPy-P-DP' ImlmPyPy-Y-ImPyPyPy-P-Dp, and mixtures 
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thereof wherein Im is N-methylimidazole, Py is N-methylpyrrole, y is y-aminobutyric 
acid, P is P-alanine, and Dp is dimethylaminopropylamide. 

12. A composition comprising: 

a. a transcription inhibiting amount of at least one polyamide chosen from the group 
consisting of ImPyPyPy-y-ImPyPyPy-P-Dp, ImPy-P-ImPy-y-ImPy-p-ImPy-p-Dp, 
ImPy-P-PyPyPyPy~P-ImPy-p-Dp, ImPy-P-ImPy-P-PyPyPyPy-p-ImPy-p-ImPy-P- 
Dp, ImPy-p-ImPyPyPy-y-ImPyPyPy-p-PyPy-P-Dp, ImlmPyPy-y-ImPyPyPy-P-Dp, 
and mixtures thereof wherein Im is N-methylimidazole, Py is N-methylpyrrole, y is 
y-aminobutyric acid, p is p-alanine, and Dp is dimethylaminopropylamide; and 

b. a pharmaceutically acceptable excipient. 

13. The composition of claim 12 wherein the composition is suitable for parenteral use. 

14. A method of improving the binding affinity of a polyamide selective for an identified 
target DNA in a viral genome comprising the step of replacing a carboxamide binding 
pair that does not include N-methyl-imidazole carboxamide with a carboxamide 
binding pair consisting of P-alanine paired with p-alanine. 

15. A method of inhibiting the binding of the zinc finger protein TFIEA to the 5S 
ribosomal RNA gene internal control region, comprising: 

contacting the 5S ribosomal RNA gene internal control region with an inhibiting 
amount of a polyamide of the formula ImPyPyPy-y-ImPyPy-p-Dp, wherein Im is N- 
methylimidazole, Py is N-methylpyrrole, y is y-aminobutyric acid, p is P-alanine, and 
Dp is dimethylaminopropylamide. 

16. A composition comprising a transcription inhibiting amount of at least one polyamide 
chosen from the group consisting of ImPyPyPy-y-ImPyPy-P-Dp, ImPy-p-ImPy-y- 
ImPy-P-ImPy-P-Dp and mixtures thereof and a pharmaceutically acceptable excipient 
suitable for the treatment of HIV- 1 infection. 

17. A method of treating a human patient with an HIV-1 infection comprising the step of 
administering a composition comprising a transcription inhibiting amount of at least 
one polyamide chosen from the group consisting of ImPyPyPy-y-ImPyPy-p-Dp, ImPy- 
P-ImPy-y-ImPy-P-ImPy-P-Dp and mixtures thereof and a pharmaceutically acceptable 
excipient. 

18. A method of treating HTV-1 infected human blood cells in vitro comprising the step of 
administering a composition comprising a transcription inhibiting amount of at least 
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one polyamide chosen from the group consisting of ImPyPyPy-y-ImPyPy-P-Dp, ImPy- 
p-ImPy-y-ImPy-p-ImPy-P-Dp and mixtures thereof. 

19. A composition comprising a transcription inhibiting amount of ImPy-P~ImPy-y-PyPy- 
P-PyPy-p-Dp and a pharmaceutical^ acceptable excipient suitable for the treatment of 

5 a cancer characterized by the over-expression of her-2/neu. 

20. A method of treatment of a adenocarcinoma of the ovary, endometrium, breast, 
fallopian tube and cervix comprising administering the composition of claim 19. 
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FIG. 22A FIG. 22B 
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gene sequence flanking TATA region 



HIV- 1 S'- TGPTGPA TAT A AGPAGCT GC'T-'V 

1L-5 5'- AGCIfl£ cTGAGAGTTTTTAACCATTACA-3' 

IL-2 5'-TT A AC A GT AT A A AT TGC A tCIC- 3 ' 

IL- 1 3 5'-TTGGGCCT ATAAAAGCTGCcAC-3' 

TNF-alpha 5'-AGGGACATATAAAGGCAGTTG-3* 

TGF-beta 5'-GGGGCTGTATTTAAGGACACCG-3' 

GM-CSF 5'-CTCTGTGTATTTAAGAGCTCTT-3' 

TNF-beta 5'-CCTCCTCTATAAAGGGACCTGA-3' 
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Specific ets-1 binding inhibition 
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